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DEFECTS IN DRUG MET ABOLISM 
FTBT.D OF TWK TNVENTION 

The invention relates to genetic material, 
specifically primers, for use in a method designed to 
5 determine the genotype of an individual; and also a kit, 
including the genetic material of the invention, for 
^ performing the method of the invention. 

tt ft<7«;ftROPND OF TffB TNVENTION 

10 jt is well known that genetic polymorphisms in 

drug metabolizing genes give rise to a variety of 
phenotypes. This information has been used to advantage 
in the past for developing genetic assays that predict 
phenotype and thus predict an individual's ability to 

15 metabolize a given drug. The information is of particular 
value in determining the likely side effects and 
therapeutic failures of various drugs. The availability 
of this sort of information will result in routine 
phenotyping being recommended for certain categories of 

20 patients. 

Drug metabolism is carried out by. the cytochrome 
P450 family of enzymes. For example, the cytochrome P450 
isozyme gene, CYP2C9 encodes a high affinity hepatic 
[S] -warfarin 7-hydroxylase which appears to be principally 

25 responsible for the metabolic clearance of the most potent 
enantiomer of warfarin. Similarly, the cytochrome P450 
isozyme gene, CYP2A6, encodes a protein that metabolizes 
nicotine and coumarin and activates the tobacco-specific 
nitrosamine 4- (methyinitrosamino) -1- ( 3-pyridyl) - 

30 l-butanone) (NNK) . 

It is of note that the above gene products are 
also known to metabolize other substrates, for example, 
the CYP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Naproxen, Tienilic 

35 acid, Diclofenac and Tetrahydrocannabinol. 

It follows that genetic polymorphisms or 
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mutations in either of the two aforementioned genes can 
lead to an impairment in metabolism of at least the 
aforementioned drugs. 

In so far as CYP2C9 is concerned, sequences 
reported by Yasumori et al (1987 J. Biochem. 102:1075- 
5 1082.) and Kimura et al (1987 Nuc. Acids Res. 15:10053- 

10054) show differences at several positions including a C 
to T base change that results in a Arginine/Cysteine 
polymorphism at amino acid 144 This polymorphism has 
been designated R144C. 

10 In so far as CYP2A6 is concerned, a T to A base 

change at position 488 of the cDNA sequence described by 
Yamano et al (1990 Biochemistry 29:1322-1329) results in 
substitution of Leucine 160 by Histidine. Henceforth this 
mutant form of the gene will be designated CYP2A6vl. 

15 The variant CYP2A6V1 encodes an enzyme that is 

unstable and catalytically inactive. It is found in the 
general population at a frequency of about 1% but does not 
account for all slow metabolizers of coumarin. 

Since the cDNA sequence structure of CYP2C9 and 

20 CYP2A6 are known, and since it is also known to perform 

genetic assays to determine whether a preselected mutation 
is present within a given gene, it should, in theory, be 
possible to design assays which specifically determine 
whether either of the aforementioned mutations are present 

25 in each of the respective aforementioned genes. 

However, we have found an extraordinarily high 
degree of exon homology in the cytochrome P450 genes. 
This has resulted in non-specific binding of assay 
materials and poor performance of assays. In the instance 

30 where primers have been used to hybridize to genetic 

material, non-specific binding of such primers has taken 
place, and in the further instance where primers have been 
used to hybridize to genetic material with a view to 
performing a polymerase chain reactions we have found that 

35 related genes have also been amplified, for example, 
CYP2A7, CYP2A12 and CYP2C8 have also been amplified. 
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SUMMARY OF THE INVENTION 

The present invention relates to novel variant 
alleles in cytochrome P450 genes which express enzymes 
involved in the metabolism of particular drugs and/or 
chemical carcinogens. 

* One object of the present invention relates to 

the discovery of new mutant or variant CYP2A6 alleles 
wherein the human gene is characterized. A new variant 
allele has been found which is designated CYP2A6v2. The 
cDNA and genomic sequence of CYP2A6V2 is provided in the 

10 present invention* Another new gene related to 

CYP2A6 has been discovered and is designated CYP2A13 . The 
cDNA and genomic sequence of CYP2A13 is provided in the 
present invention. 

Another object of the present invention relates 

15 to the use of intron sequences to specifically identify 
CYP2A6 and CYP2C9 variants in a gene specific detection 
assay. 

Another object of the present invention is to 
use an oligonucleotide probe, specific for regions unique 
20 to a particular CYP2 variant to screen for the presence or 
absence of the variant in a sample. 

Yet another object of the invention is to 
provide genetic material, a method, and a kit which enable 
genotyping of the CYP2C9 and CYP2A6 gene with a view to 
25 providing phenotypic information concerning drug 
metabolism. 

A further object of the present invention 
provides a method for diagnostically determining the 
sensitivity of a patient for specific drugs and chemical 
30 carcinogens. Such a method is widely applicable in 

determining the proper dosage of a drug for a patient. 

Another object of the present invention provides 
a method of genotyping CYP2A6 and CYP2C9 and determining 
whether a mutation has altered the sequence of these genes 
35 and hence altered sensitivity to particular drugs and 
chemical carcinogens. 
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In accordance with the present invention a 
method is provided which utilizes the finding that each 
variant of a CYP2 gene has specific nucleotide differences 
as compared with the wild-type CYP2 gene. Such nucleotide 
changes can be utilized in a probe-hybridization assay, 
which is capable of specifically detecting a chosen 
variant and not other variants. 

The present invention also provides a genotyping 
method for identifying the presence or absence of a 
mutation at codon 144 of the coding sequence of CYP2C9, or 

10 alternatively, at codon 160 of the coding sequence of 
CYP2A6, or alternatively , a gene conversion event 
involving CYP2A6 and CYP2A7 in exons 3, 6 or 8 comprising 
use of a portion of DNA. Such a mutation is then 
correlated to the sensitivity of particular drugs and 

15 chemical carcinogens. 

The present invention further relates to a gene- 
specific bioassay which is capable of distinguishing 
between the CYP2 genes and identify the presence or 
absence of a mutation in CYP2A6 and CYP2C9 genes. Such a 

20 bioassay can diagnostically predict the sensitivity of an 
individual to particular drugs or chemical carcinogens. 
For example, the CYP2C9 variants identify a sensitivity to 
a commonly used anti-coagulant drug, warfarin. The CYP2A6 
variants identify sensitivity to coumarin, nicotine and 

25 nitrosamines. The sensitivity to nicotine may be used to 
predict a predisposition to tobacco-related diseases , a 
propensity to smoking and adverse reactions to exposure to 
nicotine. Further, CYP2A6 genes are associated with the 
activation of nitrosamines, elevated levels of which have 

30 been correlated with many cancers. 

The present invention also provides a method of 
genotyping the CYP2A6 and CYP2C9 genes using ailele- 
specific amplification reaction. 

In addition, a highly-specific combination 

35 genotyping bioassay has been developed to identify 

mutations within CYP2A6 and CYP2C9 which are linked to 
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sensitivity to particular drugs and chemical carcinogens. 
This combination bioassay comprises a gene-specific 
amplification reaction, an exon-specif ic amplification 
reaction and an endonuclease cleavage reaction wherein 
only one form, either mutant or wild-type is cleaved, 
5 producing either a single nucleic acid fragment or 
multiply nucleic acid fragments depending upon the 
presence or absence of the mutation. For example, one 
CYP2C9 variant, R144C, which contains a C 472 ->T mutation can 
be identified by an Avail restriction site. CYP2A6 
10 variants can also be identified by their corresponding 

mutations. CYP2A6vl which contains a T 4B8 -A mutation can 
be identified by a Xcml restriction site. CYP2A6v2 which 
contains a T 415 -A mutation can be identified by a Ddel 
restriction site. 
15 The present invention also relates to a method 

for screening patients for drug sensitivity prior to their 
treatment with that drug, thereby alerting a physician of 
a drug sensitivity. In addition, the method may be used to 
screen patients for a predisposition to cancers related to 
20 excessive nitrosamine activation, which are associated 

with mutations within the CYP2A6 gene locus. Further, the 
method may be used to screen patients for a sensitivity to 
chemical carcinogens, based upon the genotype of the 
CYP2A6 and/ or CYP2C9 alleles. 
25 one such new allele variant, CYP2A6V2, has 98% 

nucleotide similarity and 80% amino acid similarity with 
the wild type CYP2A6, respectively. The present invention 
relates to the new CYP2A6v2 variant, the cDNA sequence and 
its genomic sequence wherein the alterations in sequence 
30 are within exons 3, 6 and 8, which are attributed to a 
gene conversion. In addition, another new gene, also 
involved in drug metabolism has been identified, and has 
been designated CYP2A13. This gene plays a similar role 
in drug metabolism as CYP2A6. These new gene sequences or 
35 fragments thereof are used as probes in identifying 
specif ic CYP2 variants in samples. In additions, 
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fragments of the new genes are used as primers in a 
genotyping assay. 

The invention further provides isolated CYP2AV2 
and CYP2A13 cDNAs for use in gene therapy and replacement 
protocols for individuals who are predisposed to 
5 sensitivity to needed drugs or to chemical or 
environmental carcinogens. 

In accordance with an aspect of the present 
invention, there are provided primary human cells which 
are genetically engineered with CYP2A6V2 or CYP2A13 DNA 

10 (RNA) which encodes a therapeutic agent of interest, and 
the genetically engineered cells are employed as a 
therapeutic agent. (The term "therapeutic," as used 
herein, includes treatment and/ or prophylaxis.) 

Gene expression in an organism in accordance 

15 with the practices of this invention is regulated, 

inhibited and/or controlled by incorporating in or along 
with the genetic material of the organism non-native DNA 
which transcribes to produce an RNA which is complementary 
to and capable of binding or hybridizing to a mRNA 

20 produced by a gene located within said organism. Upon 

binding to or hybridization with the mRNA, the translation 
of the mRNA is prevented. Consequently, the protein coded 
for by the mRNA is not produced. In the instance where 
the mRNA translated product, e.g. protein, is vital to the 

25 growth of the organism or cellular material, the organism 
is so transformed or altered such that it becomes, at 
least, disabled. 

Accordingly, in the practices of this invention 
from a genetic point of view as evidenced by gene 

30 expression, new organisms are readily produced. Further, 
the practices of this invention provide a powerful tool or 
technique for altering gene expression or organisms 
through gene therapy. The practices of this invention may 
cause the organisms to be disabled or incapable of 

35 functioning normally or may impart special properties 

thereto. The DNA of CYP2A6V2 or CYP2A13 employed in the 
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practices of this invention can be incorporated into the 
treated or effected organisms by direct introduction into 
the nucleus of a eukaryotic organism or by way of a 
plasmid or suitable vector containing the special DNA of 
this invention in the case of a procaryotic organism. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Embodiments of the invention are described by 
way of example only with reference to the accompanying 
10 figures wherein; 

Fig. 1. Shows the sequence of exon 2, intron 2 
and exon 3 of CYP2C8 and CYP2C9, cDNA sequences (from 4) 
are shown at the top of the page together with sequences 
from 6 genomic clones encompassing exon 2, intron 2 and 
!5 exon 3 of CYP2C8 and CYP2C9. The position of the 

polymorphism at codon 144 of CYP2C9 and the PCR primers 
are indicated. 

Fig. 2. Shows the sequence of intron 2, exon 3 
and intron 3 of CYP2A6, CYP2A7 and CYP2A12. The position 
20 of the polymorphism at codon 160 in CYP2A6 and the PCR 
primers are indicated. 

Fig. 3. Shows the detection of CYP2C9 Arg U4 Cys 
polymorphism by PCR. Following amplification, samples 
were digested with Avail and analyzed on a 1.8 % agarose 
25 gel . Lane I and lanes 3 to 6 show homozygous wild-type 
subjects, lane 2 a heterozygous individual and lane 7 
undigested PCR product. 

Fig. 4. Shows detection of CYP2A6 Leu 16Q . His 
polymorphism by PCR. Two parallel PCR reactions were 
30 carried out and the products analyzed on a 1 % agarose 

gel. Lanes 1, 3, 5 and 7 show the results of the wild-type 
specific assay and lanes 2, 4, 6 and 8 the results of the 
variant-specific assay for the same four subjects. 
Subjects I and 2 (lanes 1-4) are homozygous wild-type, 
35 subject 3 (lanes 5 and 6) heterozygous and subject 4 
(lanes 7 and 8) homozygous for the mutation. 
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Fig. 5. Shows distribution of the weekly 
maintenance doses for warfarin in patients (n=57) 
homozygous for the CYP2C9 wild-type allele (open bars) and 
heterozygous (n=37) for the R144C mutant allele (solid 
bars) • Arrows show the median weekly dose requirement of 
5 warfarin for each genotype. 

Fig. 6. Represents 7-hydroxylation of coumarin 
(%) in a family genotyped for the CYP2A6 and CYP2A6vl 
alleles, showing a subject homozygous for the CYP2A6vl 
allele who is deficient in coumarin 7-hydroxylation. 

10 Fig. 7. Shows the difference between the genomic 

and cDNA sequences for the CYP2A6 gene. 

Figs. 8a and b. Shows the conversion event which 
leads to the CYP2A6V2 allele. 

Figs. 9a through 9c. Shows the detection of 

15 CYP2A6V2 by PCR. (Fig. 9A) gene-specific amplification by 
PCR of the CYP2A6 gene using E3F and E3R. Lanes 1 to 4 
show the 7.8 Kb band obtained from several representative 
human genomic DNA templates, lane 5 correspond to a 
negative control in the absence of template and lane 6 

20 contains 1 Kb DNA ladder (GIBCO BRL) as six markers. 

(Fig. 9B) Exon-specific PCR amplification of exon 3 from 
the 7.8 Kb long-PCR product and restriction endonuclease 
pattern obtained after digestion with Xcml (left) and Ddel 
(right) to detect the CYP2A6vl and CYP2A6v2 alleles, 

25 respectively. The genotypes shown correspond to: wild type 
(+/+) r heterozygous (+/-) and homozygous (-/-) subjects. 
(C) The genotyping strategy which has been developed. 
Exons are indicated by boxes. The position of the 
corresponding primer pairs are indicated by horizontal 

30 arrows . Xcml and Ddel restriction sites generate 

digestion patterns for the different alleles having 
fragment sizes as shown. 

Fig. 10. Schematic diagram depicting 
methodology underlying a CYP2C9 genotyping assay. 

35 Fig. 11. CYP2A6V2 cDNA sequence. 

Fig. 12. CYP2A6V2 genomic DNA sequence having 
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7216 base pairs. 

Fig. 13. CYP2A13 cDNA sequence. 

Fig. 14. CYP2A13 genomic DNA sequence having 

8779 base pairs. 

Fig. 15. Agarose minigel electrophoresis of PCR 
5 products. The CYP2C9 wild-type allele (Arg-144) and R144C 
respectively, Lanes marked »+/+" and "+/-" contain 
homozygous wild types and heterozygotes respectively, the 
right-hand lane contains a 100 bp ladder. 

nPTATT^D DESCRIP TION OF THE INVENTION 
10 The cytochrome P450 isozyme gene, CYP2C9 encodes 

a high affinity hepatic [S]-warfarin 7-hydroxylase which 
appears to be principally responsible for the metabolic 
clearance of the most potent enantiomer of warfarin along 
with metabolizing a number of other drugs and chemical 
15 carcinogens. Similarly, the cytochrome P450 isozyme gene, 
CYP2A6, encodes a protein that metabolizes nicotine, 
coumarin and a host of other drugs and chemical 
carcinogens CYP2A6 also activates the tobacco-specific 
nitrosamine 4- (methylnitrosamino) -1- (3-pyridyl) - 
20 i-butanone (herein referred to as "NNK") . Many cancers 
have been associated with activation and/or accumulation 
of nitrosamines. The present invention allows detection of 
a predisposition to such cancers. 

It is of note that the above gene products are 
25 also known to metabolize other substrates. For example, 
the CYP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Imipramine, Naproxen, 
Tienilic acid, Diclofenac and Tetrahydrocannabinol and 
hence can also be used to detect sensitivities to these 
30 drugs. A list of CYP2C9 drug substrates has been 
documented and is incorporated herein by reference 
(Gonzalez & Idle 1994 Clin. Pharmacokinet 26:59-70). 
Hence, the present invention can be used to screen for 
sensitivities to these drugs. 
35 m addition, CYP2C9 has been associated with the 

metabolism of chemical carcinogens, such as polycyclic 
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aromatic hydrocarbons. For example, the most ubiquitous 
environmental carcinogen, benz-[a]-pyrene is metabolized 
by CYP2C9. Benz-[a]-pyrene is found in tobacco, barbecued 
meats, car exhaust and generally, in polluted air. This 
compound, as it accumulates in the body becomes a potent 
5 DNA intercalating agent, ultimately resulting in cell 

transformation and the formation of tumors. The present 
invention provides a diagnostic method of screening 
individuals for their ability to metabolize and hence 
inactivate benz-[a]-pyrene. For example, a homozygote 

^ wild-type CYP2C9 individual would be better able to 
tolerate high levels of benz-[a]-pyrene than a 
heterozygote of the CYP2C9 allele. 

Similarly, the CYP2A6 allele is associated with 
drug sensitivity and carcinogen metabolism. Coumarin 

15 sensitivity is directly related to the presence of a 

variant CYP2A6 allele, such as CYP2A6vl, CYP2A6V2 and also 
CYP2A13. Coumarin is a drug used in treatment of 
neoplastic diseases, such as lymphomas. (See flartindale; 
The Extra Pharmacopoeia 1993 Ed. Reynolds, J.E.F., The 

20 Pharmaceutical Press, London, p. 1358). Its suggested 

dosage is very high. Therefore, the present invention is 
useful in determining a patient's sensitivity to the drug 
in order to prescribe a proper dosage and avoid toxicity. 
Another drug, Thiotepa*, is used in the 

25 treatment of a variety of neoplastic diseases, such as in 
treating women with breast cancer and children with brain 
tumors. Thiotepa is metabolized by CYP2A6 into Tepa, 
which is an intermediate more therapeutically potent than 
Thiotepa. Therefore, if a patient has a very active 

30 CYP2A6 enzyme, it is likely the patient will require lower 
doses of Thiotepa to provide a therapeutically effective 
amount. As one can see, the dosage provided to a patient 
is dependent upon the rate a patient is capable of 
metabolizing activating the drug. The present invention 

35 has identified variant alleles whose enzymatic activity is 
compromised. In addition, the present invention provides 
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a simple method of genotyping patients for Thiotepa drug 
sensitivity. With information concerning patient 
sensitivity to such drugs, the proper dosage can be 
provided, hence maximizing drug efficiency and minimizing 
drug toxicity. 

5 Further, CYP2A6 has been associated with 

nicotine metabolism. In addition to being an active 
ingredient in tobacco, nicotine also has several clinical 
uses. Nicotine is used clinically to treat various 
neurological disorders, such as Parkinson's disease and 

10 Alzheimer's disease. In addition, nicotine is used to 

treat tobacco addiction. In all of these situations, it 
is important to know a patient's sensitivity to nicotine, 
since extremely sensitive patients will become violently 
ill upon administration of nicotine. Therefore the 

15 present invention provides a method of identifying 

nicotine-sensitive patients by genotyping a patient's 
CYP2A6 allele. The present invention also provides a 
convenient method for determining an individual's general 
predisposition to using tobacco based upon their 

20 sensitivity to nicotine. 

In addition, CYP2A6 is involved in activating 
nitrosamines, thereby producing the potent carcinogen NNK. 
Increased levels of NNK have been associated with a 
variety of cancers, including but not limited to lung 

25 cancer, nasal-pharynx cancers, throat cancers and colon 
cancers. In general, elevated levels of CYP2A6 has been 
associated with cancers associated with exposure to 
nitrosamines. The present invention may detect a 
patient's predisposition to such cancers. The presence of 

30 a CYP2A6 gene or a variant thereof will affect the 

likelihood that procarcinogens present in tobacco smoke 
will be activated into carcinogenic nitrosamines and 
nitrosamine-derivatives and therefore result in the 
development of a cancer. 

35 it follows that genetic polymorphisms or 

mutations in either of the two aforementioned genes can 



WO 95/34679 



PCT/DS95/07605 



- 12 - 

o 

lead to an impairment in metabolism of at least the 
aforementioned drugs and chemical carcinogens. 

The present invention relates to the 
identification of the absence or presence of mutations in 
CYP2C9 and CYP2A6 and thus predict the phenotype of an 
5 individual and so predict whether and how an individual is 
likely to metabolize particular drugs and chemical 
carcinogens. For instance, the R144C mutation arising 
from a C 472 -+T base substitution in the CYP2C9 gene results 
in a reduction in warfarin metabolism. This implies that 

10 patients with this mutation receiving warfarin require a 
lower dose to maintain an anticoagulation target than 
those patients who do not have the mutation and are also 
receiving warfarin. Conversely, homozygous wild- types 
require higher doses in order to maintain an 

15 anticoagulation target. 

"Mutation" , as the term is used herein denotes 
an allelic variation of a known sequence, which alters the 
expressed gene product's activity. Such a variation need 
not completely inactivate the gene product's activity but 

20 merely alter it. 

Similarly, one mutation within CYP2A6vl arising 
from a T 488 -*A base change results in substitution of 
Leucine 160 by Histidine. Another CYP2A6 variant, 
CYP2 A6v2 , has been identified which differs from CYP2A6 in 

25 the regions of exons 3, 6 and 8. One particular mutation 
in CYP2A6v2, T 415 ->A mutation is useful in the assay of the 
present invention. These substitutions are very useful in 
detecting predispositions to cancers associated with 
tobacco and activation of nitrosamines. The normal CYP2A6 

30 enzyme functions in the metabolism of nicotine, one of the 
carcinogenic compounds in tobacco. 

In addition, the present, invention relates to 
the identification of a new variant of CYP2A6 designated 
CYP2A6V2. The variations of CYP2A6V2 from CYP2A6 bear 

35 sequence relatedness with the corresponding exons of the 

CYP2A7 gene, suggesting a recent gene conversion. The cDNA 
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and genomic sequence for this gene is provided in the 
present invention. Hence, at least three different 
allelic variants of CYP2A6 exist and are illustrated in 
Figure 8. These allelic variants include CYP2A6, CYP2A6V1 
and CYP2A6V2. 

5 Further, the present invention relates to a new 

CYP2A gene, designated CYP2A13 . This gene produces an 
inactive form of CYP2A6, however variants at particular 
positions, including amino acid positions 117, 209 and 365 
produce an enzyme which may alter the enzyme's activity 
10 and hence affect drug sensitivity. These mutations in 
CYP2A6 are likely to result in a deficiency or impaired 
activity of one of the enzymes responsible, for example, 
for metabolizing drugs, nicotine and nitrosamines . 

CYP2A13 is considered a new cytochrome P450 
15 gene. However, since the CYP2A13 gene product has a 

similar function as the CYP2A6, it is discussed herein as 
a variant of CYP2A6. That is, assays using the specific 
mutated amino acid positions 117, 209 and 365 of CYP2A13 
and detecting variations at those positions are indicative 
20 of CYP2A6-like variant functions. 

In one embodiment, the CYP2A6v2 or CYP2A13 
proteins or functional portions thereof are expressed as 
recombinant genes in a cell, so that the cells may be 
transplanted into an individual in need of gene therapy 
25 due to the predisposition to a carcinogen-associated 
cancer or a sensitivity to a drug. To provide gene 
therapy to an individual, a genetic sequence which encodes 
for all or part of the CYP2A6V2 or CYP2A13 ligands are 
inserted into vectors and introduced into host cells. 
30 Examples of vectors that may be used in gene therapy 

include, but are not limited to, defective retroviral, 
adenoviral, or other viral vectors (see, e.g., Mulligan, 
R.C., 1993, Science , 260:926-932). The means by which the 
vector carrying the gene may be introduced into the cell 
35 includes, but is not limited to, microinjection, 

electroporation, transduction, or transfection using DEAE- 



WO 95/34679 



PCT/US95/07605 



- 14 - 

o 

dextran, lipofection, calcium phosphate or other 
procedures known to the skilled routineer (see, e.g., 
Sambrook et. al. (Eds.)/ 1989, In "Molecular Cloning. A 
Laboratory Manual", Cold Spring Harbor Press, Plainview, 
New York) • Examples of cells into which the vector 
J carrying the gene may be introduced include, but are not 
limited to, continuous culture cells, such as COS, 
NIH/3T3, and primary or culture cells of the relevant 
tissue type. 

More specifically, there is provided a method of 

10 enhancing the therapeutic effects of blood cells, that are 
infused in a patient, comprising: (i) inserting into the 
blood cells of a patient a DNA (RNA) segment encoding 
CYP2A6v2 or CYP2A13 gene product that enhances the 
therapeutic effects of the blood cells; and (ii) 

15 introducing cells resulting from step (i) into the patient 
under conditions such that the cells resulting from step 
(i) "target" to a tissue site. In the alternative, as 
previously described the cells are not "targeted" and 
functions as a systemic therapeutic. The genes are 

20 inserted in such a manner that the patient's transformed 
blood cell will produce the agent in the patient's body. 
In the case of antigen-specific blood cells which are 
specific for an antigen present at the tissue site, the 
specificity of the blood cells for the antigen is not lost 

25 when the cell produces the product. 

Alternatively, as hereinabove indicated, 
CYP2A6v2 or CYP2A13 DNA (RNA) may be inserted into the 
blood cells of a patient, in vivo, by administering such 
DNA (RNA) in a vehicle which targets such blood cells. 

30 Further details regarding methods of gene 

therapy are provided in Anderson et al., U.S. Patent No. 
5,399,343 which is herewith incorporated herein by 
reference. 

In another embodiment, antisense CYP2A6v2 or 
35 CYP2A13 DNA or RNA may be used to control the expression 
of CYP2 gene. For example, antisense therapy may be used 
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to control CYP2A6 / s ability to activate dangerous 
nitrosamines by curbing its expression. Methods of 
producing such antisense molecules are described in U.S. 
Patent No. 5,190,931, which is incorporated herein by 
reference. 

Developing a genotyping assay, which could 
distinguish the CYP2 genes of interest from other 
cytochrome P450 genes required careful engineering since 
these genes have a high degree of sequence homology. To 
overcome this problem, one embodiment of the present 
invention has elucidated the genomic sequence structure of 
CYP2C9 and CYP2A6 with a view to making, in part, intron 
specific primers. That is to say primers which, in part, 
hybridize to at least one intron, preferably an intron 
adjacent to an exon including the mutation of interest, in 
15 the gene to be examined. Since there is less homology 

between the introns of cytochrome P450 genes, it has been 
found that using intron specific primers, gene specific 
assay can be undertaken. The present invention has a 
further advantage of using intron specific primers in so 
20 far as the use of such primers facilitates the manufacture 
of an optimum length of DNA which in turn facilitates the 
specificity of the instant bioassay. 

A "genotyping" assay as the term is used herein 
refers to any diagnostic or predictive test to detect the 
25 presence or absence of allelic variants of a known gene 

sequence at a specified gene locus. Two gene loci are of 
particular interest in the present invention, CYP2A6 and 
CYP2C9. 

Further, the present invention relates to 
30 differences between the genomic DNA sequence structure and 
the cDNA sequence structure, as illustrated in Figure 7. 
As a result, primers directed at the genomic sequence 
structure have been developed which are more reliable. 

Several methods are provided for identifying the 
35 presence or absence of a mutation at codon 144 of the 

coding sequence of CYP2C9, or alternatively, at codon 160 
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of the coding sequence of CYP2A6, or alternatively, a gene 
conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 
or 8 comprising a DNA encompassing the region of a CYP2 
gene unique to that variant. 

one such method relates to an assay which 
5 contemplates the use of one specific primer which 
specifically encompasses the region containing the 
mutation, and a second primer which is complementary to 
another portion of the gene. The second primer sequence 
chosen is based upon the CYP2A6, CYP2C9 or CYP2A13 
10 sequences as set forth in figures 12, 1 and .14, 

respectively, depending upon the preferred size of the 
amplification product. One skilled in the art will know 
how to select second primer based on the region of gene 
chosen for amplification. These primers need not be 
15 identical to a given sequence but must be sufficiently 
complementary to hybridize to the target region in a 
specific manner. In short, the primers are preferably at 
least substantially homologous to the nucleic acid 
sequence provided. 
20 Nucleic acid sequences includes, but is not 

limited to, DNA, RNA or cDNA. Nucleic acid sequence as 
used herein refers to an isolated nucleic acid sequence. 
Substantially homologous as used herein refers to 
substantial correspondence between the nucleic acid primer 
25 sequence of as described herein and that of any other 
nucleic acid sequence. Substantially homologous means 
about 50-100% homologous homology, preferably by about 70- 
100% homology, and most preferably about 90-100% homology 
between the particular sequence discussed and that of any 
30 other nucleic acid sequence. 

In the instant application, the term "primer" is 
further used to designate a molecule comprising at least 
three nucleotides, the exact length being determined by 
the requisite amount of DNA needed, under given reaction 
35 conditions, to bind to or interact with a test sample so 
as to identify the presence or absence of either of said 
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mutations. Preferably, the primer is usually between 15 
and ideally about 20 to 50 oligonucleotides in length. 

The primer is selected, or adapted, to be 
substantially complementary to a part of DNA which is 
adjacent to the region of at least one of the 

5 aforementioned mutations. Thus such a primer is able is 
hybridize with a part of DNA that contains a region in 
which the mutation of interest may be found. Although the 
primer may not reflect the exact sequence of the region in 
which the mutation is thought to occur, the more closely 

10 the primer is to this sequence, then the better the 

binding will be. Ideally, the more closely the sequence 
of the 3' end of the primer is to said region the better 
the binding or interaction will be. 

An alternative method for using the sequence 

15 unique to a variant for detection relates to use of an 
oligonucleotide probe for specifically detecting the 
presence or absence of a CYP2 variant gene in a sample, 
this method comprises the steps of contacting the sample 
with a nucleic acid probe, allowing hybridization, forming 

20 a probe: CYP2 variant complex; washing excess probe from 
probe: CYP2 variant complex; and detecting probe: CYP2 
variant complex, wherein a positive signal is an 
indication of the presence of the CYP2 variant in the 
sample. 

25 The hybridization of the probe to sample nucleic 

acids can be carried out by any of the methods commonly 
used in the art. Such methods include but are not limited 
to. Dot blot, Colony hybridization, Southern blot, 
solution hybridization and in situ hybridization. 

30 Washing the excess probe from the probe: CYP2 

variant DNA can be accomplished by many well-known 
methods. Simply rinsing the complex with excess buffer 
will facilitate removal of excess probe. Alternatively, 
washing may entail separating the probe: CYP2 variant 

35 complex from excess probe. Many methods are known to one 
skilled in the art and include but are not limited to 
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centrif ligation, filtration and magnetic force. 

According to the present invention there is 
provided a portion of DNA suitable for use as a primer in 
a method for identifying the presence or absence of a 
mutation either at codon 144 of the coding sequence of the 
gene CYP2C9, or alternatively, at least one gene 
conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 
or 8, or alternatively, at codon 160 of the coding 
sequence of the gene CYP2A6; comprising a DNA which is 
adapted to hybridize to at least one intron of at least 
one of said genes. 

In one embodiment, the method comprises the use 
of at least one restriction endonuclease to digest DNA 
from individuals to be tested. In this instance, DNA from 
individuals positive for the wild-type form of CYP2C9 
15 provide a digest with a restriction endonuclease, such as 
Avail results in production of two fragments, a first 
fragment including 270 base pairs and a second fragment 
including 50 base pairs. In contrast, individuals having 
the aforementioned mutation in CYP2C9 present a single 
20 fragment of 320 base pairs only. This is due. to a loss of 
the Avail site. The CYP2A6 gene variants can also be 
distinguished by the occurrence of specific restriction 
endonuclease sites. The CYP2A6vl variant, which is a 
T 488 -*A mutation in exon 3 can be identified by a variant- 
25 specific Xcml restriction site. The CYP2A6v2 variant, 
which contains a C 415 ->A mutation within exon 3 can be 
identified by a variant-specific Ddel restriction site. 
The wild-type CYP2A6 gene does not contain either an Xcml 
or Ddel site. The results of such restriction 
30 endonuclease digestions are illustrated in Figure 9. 

It may be necessary to amplify the DNA prior to 
digestion. Such may be the case when the DNA of interest 
is present in minute quantities in a sample. In such 
circumstances, amplification of DNA to be tested is 
35 undertaken before digesting the DNA as described above. 
This provides for a greater quantity of materials. 
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Amplification is performed using any conventional 
technique, such as by a PCR reaction. Many other 
techniques for amplification can be used in producing 
sufficient DNA for detections. Such amplification 
techniques are well-known to the skilled artisan and 
5 include, but are not limited to polymerase chain reaction 
(PCR), PCR in situ, ligase amplification reaction (LAR) , 
ligase hybridization, QB bacteriophage replicase, 
transcription-based amplification system (TAS) , genomic 
amplification with transcript sequencing (GAWTS) and 

10 nucleic acid sequence-based amplification (NASBA) . A 

general review of these methods is available in Landegren, 
et al., Science 242:229-237 (1988) and Lewis, R. , Genetic 
Engineering News 10:1, 54-55 (1990), which is incorporated 
herein by reference. 

*5 one embodiment of the present invention uses 

oligonucleotide primers in an amplification and detection 
assay. A basic description of nucleic acid amplification 
is described in Mullis, U.S. Patent No. 4,683,202, which 
is incorporated herein by reference. The amplification 

20 reaction uses a template nucleic acid contained in a 

sample, two primer sequences and inducing agents. The 
extension product of one primer when hybridized to the 
second primer becomes a template for the production of a 
complementary extension product and vice versa, and the 

25 process is repeated as often as is necessary to produce a 
detectable amount of the sequence. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 

30 for this purpose include, for example, E.coli DNA 

polymerase I, thermostable Tag DNA polymerase, Klenow 
fragment of E.coli DNA polymerase I, T4 DNA polymerase, 
other available DNA polymerases, reverse transcriptase and 
other enzymes which will facilitate combination of the 

35 nucleotides in the proper manner to form amplification 
products . 
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A sample being screened for the presence or 
absence of a mutation in CYP2A6 and/or CYP2C9 genes can be 
tested with the instant invention. The nucleic acid 
material can be in purified or nonpurified form, provided 
the sample contains the CYP2A6 and/or CYP2C9 genes. The 

5 sample may be derived from any tissue or bodily fluid, 
wherein the patient's DNA can be found. A clinically 
practical type of sample is a blood specimen which 
contains patient DNA and can conveniently be genotyped in 
the bioassay of the present invention. 

10 T k e "primers", as the term is used in the 

present invention refers to an oligonucleotide, whether 
occurring naturally as in a purified restriction digest or 
produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under 

15 conditions wherein synthesis of a primer extension product 
which is complementary to a nucleic acid strand is 
induced, i.e. in the presence of nucleotides and an 
inducing agent such as DNA polymerase and at a suitable 
temperature and pH. The primers are preferably single 

20 stranded for maximum efficiency in amplification, but may 
alternatively be double stranded. If double stranded, the 
primer is first treated to separate its strands before 
being used to prepare amplification products. Preferably, 
the primers are oligodeoxy ribonucleotides. The primers 

25 must be sufficiently long to prime the synthesis of 

extension products in the presence of the inducing agent. 
The exact lengths of the primers will depend on many 
factors, including temperature, source of primer and use 
of the method. For diagnostic methods, the primers 

30 typically contain at least 10 or more nucleotides. The 

oligonucleotide primers may be prepared using any suitable 
method, such as, for example, the phosphotriester and 
phosphodiester methods (Narang, S.A., et al., Meth. 
Enzymol. 68:90 (1979); Brown E.L. , et al., Meth. Enzymol. , 

35 68:109 (1979)) or automated embodiments thereof. In one 

such automated embodiment diethylphosphoramidites are used 
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as starting materials and may be synthesized as described 
by Beaucage et al.. Tetrahedron Letters 22:1859-1962 
(1981). One method for synthesizing oligonucleotides on a 
modified solid support is described in U.S. Pat. No. 
4,458,066. It is also possible to use a primer which has 

5 been isolated from a biological source (such as a 
restriction endonuclease digest) . 

In a genotyping bioassay of the present 
invention, one embodiment comprises a gene-specific 
amplification reaction, an exon-specif ic amplification 

*0 reaction and a restriction endonuclease reaction. In such 
a reaction a suitable polynucleotide polymerase is used in 
the amplification reaction, many of which have already 
been described in the art. In addition, any appropriate 
restriction endonuclease which is designed to digest the 

15 DNA and so provide information concerning genotype may be 
used. 

It may further be necessary to provide a label 
on the nucleic acid for detection. The nucleic acid can 
be DNA or RNA and made detectable by any of the many 

20 labeling techniques readily available and known to the 
skilled artisan. Such methods include, but are not 
limited to, radio-labelling, digoxygenin-labeling, and 
biotin-labeling. A well-known method of labeling DNA is 
32 P using DNA polymerase, Klenow enzyme or polynucleotide 

25 kinase. In addition, there are known non-radioactive 

techniques for signal amplification including methods for 
attaching chemical moieties to pyrimidine and purine rings 
(Dale, R.N.K. et al. 1973 Proc. Natl. Acad. Sci. USA, 
70:2238-2242; Heck, R.F. 1968 S. Am. Chem. Soc, 90:5518- 

30 5523) , methods which allow detection by chemiluminescence 
(Barton, S.K. et al. 1992 .7. Am. Chem. Soc. , 114:8736- 
8740) and methods utilizing biotinylated nucleic acid 
probes (Johnson, T.K. et al. 1983 Anal. Biochem. , 133:125- 
131; Erickson, P.F. et al. 1982 J. of Immunology Methods, 

35 51:241-249; Matthaei, F.S. et al 1986 Anal. Biochem., 
157:123-128) and methods which allow detection by 
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fluorescence using commercially available products. Non- 
radioactive labelling kits are also commercially 
available. Such a label can readily be incorporated into 
the nucleic acid during an amplification step* In the 
absence of an amplification step, a target nucleic acid 
can readily be chemically or enzymatically modified to 
carry a label. Additionally, it may be preferable to 
provide a labeled primer which may serve to incorporate a 
label into the nucleic acid target. Probes, as may be 
used in an embodiment of the invention may also be 
chemically or enzymatically labeled as described above. 

In a preferred embodiment of the invention said 
DNA primer hybridizes to an intron adjacent said position 
of said mutation. Preferably said DNA is a primer with 
the 3 '-end specific for the gene of interest. Preferably 
further still said DNA is single stranded. Preferably 
further still, in so far as the CYP2C9 mutation is 
concerned, said primers are as follows: 

HF18: position 8 of intron 2 onwards of genomic 
sequence in forward orientation comprises 
5' TGCAAGTGCCTGTTTCAGCA 3' 

HF2R: position 505 onwards of cDNA sequence in 
reverse orientation comprises 
5' AGCCTTGGTTTTTCTCAACTC 3'. 

It is of note that both these primers are 
designed to be specific for CYP2C9 and so do not amplify 
related genes such as CYP2C8, which notably also has an 
Arginine U4 present. 

Preferably, in so far as CYP2A6 is concerned, 
three primers J51, J61 and B are used in two parallel 
allele-specific PCR reactions. These primers are as 
follows: 

35 J51 comprises 5' GGCTTCCTCATCGACGCACT 3' 

(forward strand from position 479 of cDNA 
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sequence described as hIIA3 (Yamano, et al. 1990 
Biochem 29:1322-29) ) . 

J61 comprises 5' GGCTTCCTCATCGACGCACA 3' 
(forward strand from position 479 of cDNA 
sequence described as hIIA3v (Yamano , et al. 
1990 Biochem 29: 1322-29) ) • 
Both J51 and J61 contain a substitution at 
position 18 of A for C to give improved 
specificity as suggested by Newton et al (1989 
Nuc. Acids Res. 17: 2503-2516) • 
Primer B comprises 5' AATTCCAGGAGGCAGGGCCT 3' 
(reverse orientation from position 125 of intron 
3 of CYP2A6 (onwards) . Designed so that only 
CYP2A6 and not CYP2A7 or CYP2A12 are amplified. 

One method of genotyping CYP2A6 provides an 
allele-specific amplification reaction method is used. In 
this instance, DNA which is adapted to specifically 
hybridize to the wild-type or the mutant type of the gene 
is incubated with test DNA under reaction conditions and 
20 the resultant products are analyzed by electrophoresis and 
then visualized by staining with ethidiura bromide. 
Individuals who are homozygous for the wild-type allele 
produce a reaction product with primer J51 only. 
Similarly, individuals who are homozygous for the mutation 
25 produce a reaction product with primer J61 only. Those 
individuals who are heterozygous produce a reaction 
product with both J51 and J61. 

Alternatively, another method for genotyping 
CYP2A6 is provided in a specific amplification bioassay, 
30 which is achieved with primers F4 and R4 as follows: 

The F4 primer (forward) comprises 

5 ' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCAGGATT 
CATGGTGGGGCA 3', wherein a preferred fragment 
35 thereof further comprises 

5' CCTCCCTTGCTGGCTGTGTCCCAAGCTAGGC 3'. 
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The R4 primer (reverse) comprises 

5 ' GCCACCACGCCCCTTCCTTTCCGCC^TCCTGCCCCCAGTCTTAGC 
TGCGCCCCTCTC 3', wherein a preferred fragment 
thereof further comprises 
5 ' CGCCCCTTCCTTTCCGCCATCCTGCCCCCAG 3 ' . 

5 

This method of CYP2A6 genotyping involves a 
first amplification reaction with F4 and R4 primers, which 
generates a DNA fragment approximately 7.8 kb in size. 
This amplification step is facilitated by polymerases 
10 which are capable of transcribing long stretches of DNA. 

To distinguish the CYP26Avl and CYP26Av2 variant alleles, 
an exon-specific amplification step is carried out using 
the 7.8 Kb DNA fragment as template DNA. This may be 
accomplished using the following primer pair: 

15 

The E3F primer (forward) comprises 
5 ' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCGCCAAG 
CAGCTCCTG 3', wherein a preferred fragment 
thereof further comprises 
20 5/ GCGTGGTATTCAGCAACGGG 3'. 

The E3R primer (reverse) comprises 

5 ' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCCCCCGC 

ACTCGGGATGCG 3', wherein a preferred fragment 
thereof further comprises 
25 5 / TCGTCCTGGGTGTTTTCCTTC 3 9 . 

Using these primers in a second amplification 
reaction step a segment of CYP2A6 exon 3 is specifically 
amplified. The method further comprises use of the 

30 restriction endonuclease Xcml to detect the CYP2A6vl 
mutation and Ddel to detect the CYP2A6V2 mutation. 

According to a yet further aspect of the 
invention there is provided a kit for performing the afore 
described methods which kit includes at least a portion of 

35 DNA in accordance with the invention and preferably at 

least one control sample of DNA containing the mutation or 
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mutations of interest and ideally also a wild-type sample 
of DNA so that suitable comparisons can be made. 

It is of note that although the method is 
described with reference to the above methods , any 
suitable method using the genetic material of the 
5 invention may be used to identify the mutations described 
herein. 

The CYP2C9 assay has been used in a study of 
warfarin dose requirement in 94 patiertts undergoing 
anticoagulant treatment and the results obtained are 

10 summarized in Figure 5. 58 patients (61.7%) were 

homozygous for the wild-type (Arg U4 ) allele and were found 
to require a median weekly maintenance dose of 31.5 mg of 
warfarin. 36 patients (38.6%) were heterozygous and 
required a median weekly maintenance dose of 24.5 mg. The 

15 doses required by the two groups were significantly 

different (Mann-Whitney U-test, p - 0.016). No subjects 
in the group were homozygous for the mutant allele but 
based on allele frequencies and the Hardy Weinberg 
equilibrium, the predicted frequency of homozygous mutant 

20 subjects is 3.7.%. 

Comparison of the weekly maintenance dose of 
warfarin in the R144C heterozygotes (n = 36) and 
homozygous wild-type (n «= 58) reveals that the 
heterozygotes required a significantly lower dose (range 

25 of 10.5 - 80. mg). Moreover, of the patients requiring the 
lowest doses to maintain an anticoagulation target (INR 
2.0-4.0), in the range 5-15 mg per week, 9 out of 10 were 
heterozygous. At the other extreme of weekly doses >55 
mg, 5 out of 6 patients were homozygous wild-type for 

30 CYP2C9 . The significantly lower (20%) warfarin dose 

requirement of the patients with one variant R144C allele 
is consistent with the kinetic properties of the R144C 
protein with respect to (S) -warfarin hydroxylation and 
presumed in vivo metabolic clearance (Rettie et al. 1994 

35 Pharmocogen* , 4:39-42). 

The CYP2A6 genotyping assay has been used in 
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studies on coumarin metabolism. Coumarin 7 -hydroxylase 
activity is a convenient marker activity to identify the 
presence of CYP2A6 in a particular sample. There is 
considerable variation in the ability of individuals to 
7-hydroxylate this compound which is a reaction specific 
5 for CYP2A6. A subject deficient in coumarin 

7-hydroxylation has been identified. This subject is 
homozygous for the mutant CYP2A6vl allele confirming the 
previous in vitro findings that substitution of LeuI50 by 
His results in loss of coumarin 7 -hydroxylase activity. 

10 As shown in Fig. 6, CYP2A6 genotyping and phenotyping with 
coumarin has been performed on other members of the 
proband's family and impaired coumarin 7-hydroxylation has 
been observed in heterozygotes for the CYP2A6vl mutation. 

The genotyping assays described herein resulted 

15 from a two step amplification reaction wherein first 
amplification reaction amplifies a 7.8 Kb fragment 
containing the CYP2A6 gene (Fig. 9A) and a second 
amplification reaction amplifies an exon-specif ic fragment 
of CYP2A6. The amplification product was digested with 

20 restriction endonucleases producing different patterns for 
the various CYP2A6 alleles. Representative results 
obtained for several human subjects for the detection of 
the CYP2A6vl (Xcml digestion) and CYP2A6V2 (Ddel 
digestion) are shown in Figure 9 panel B. A schematic 

25 depiction of this genotyping assay is shown in Figure 9, 
panel C. Of 155 human genomic DNA samples analyzed 21 
heterozygous (+/-) and 6 homozygous (-/-) subjects were 
detected for the CYP2A6vl allele, whereas 17 heterozygous 
(+/-) an d no homozygous were identified for the CYP2A6v2 

30 allele variant. Additionally, 7 homozygous for both 
CPYP2A6vl and CYP2A6v2 alleles were found. 

Allelic frequencies were calculated for either 
allele in several ethic groups and analyzed as shown in 
Table l. CYP2A6V1 frequency is almost identical between 

35 Caucasian and Japanese, and it is only twice the frequency 
in Taiwanese samples. Significantly, this allele is 
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completely absent in the African-American population 
within the samples studied. The Japanese population has a 
remarkable higher frequency for the CYP2A6v2 allele (28%) 
as compared to the Caucasian (2%) , Taiwanese (6%) or 
African-American (2.5%) (ethnic groups) . 



Table 1 ; Allelic frequency for the CYP2A6 gene in 
different ethnic groups. 

Allelic Frequencies (%) 



Ethnic Group 


CYP2A6 


CYP2A6V1 


CYP2A6V2 


N 


Caucasian 


75 


23 


2 


52 


Japanese 


52 


20 


28 


40 


Taiwanese 


83 


11 


6 


178 


African-American 


97.5 


0 


2.5 


40 



The following examples illustrate various 
aspects of the present invention and in no way are 
intended to limit the scope thereof. All books, articles, 
and patents referenced herein are incorporated herein, in 
toto, by reference. Other similar embodiments will be 
clear to the skilled artisan and are encompassed within 
the spirit and purview of the present invention. 



15 



20 



EXAMPLE 1 

Method for determining the genot ype CYP2C9 

Genotyping for the CYP2C9 polymorphism is 
carried out by amplification by PGR followed by digestion 
with the restriction endonuclease Avail. Amplifications 
are performed in 0.5 ml microcentrifuge tubes in a volume 
of 100 /xl containing 10 mM Tris-HCl, pH 8.8, 1.5 mM MgC12. 
50 mM KCl r 0.1% Triton X-100, 5% dimethylsulphoxide, 200 
/zM each of' dTTP, dATP, dCTP and dGTP, 250 }M of the 
primers HF18 and HF2R, 2.5 units Taq polymerase and 1 
human leukocyte genomic DNA. PCR conditions consist of 35 
cycles with a denaturation at 93 °C for 1 min. annealing at 
55 °C for 1.5 min and polymerization at 72 °C for I min. 20 
pi of the amplified DNA is incubated with 10 units Avail 
for 3h at 37 °C and then analyzed by electrophoresis on 
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1.8% agarose minigels in TBE (90 mM Tris-borate, 2 mM 
EDTA) buffer. The digestion products are visualized by 
ethidium bromide staining. DNA from individuals positive 
for the wild-type Arg,44 is digested to give fragments of 
270 bp and 50 bp whereas in individuals with the mutant 
Cys lu present, a band of 320 bp is seen due to loss of an 
Avail site (Figure 3) . 

EXAMPLE 3 

Genotyping for the CYP2C9 polymorphism was 

10 carried out by amplification by PCR followed by digestion 
with the restriction endonuclease Avail. 

One hundred patients were recruited from two 
anticoagulation clinics in the Newcastle area over four 
study days. Body weight and height were measured, the 

15 basal metabolic index ("BMI") calculated for each patient 
and details of age, sex, drug history, current and 
previous International Normalized Ratio ("INR") 
determinations, indications for anticoagulation and other 
significant health problems were all recorded. DNA was 

20 isolated by a standard manual chloroform-phenol extraction 
procedure and 1/xg was subjected to PCR analysis. As shown 
in Figure 10 the C-*T substitution, which converts Arg-*144 
to Cys, resides in exon 3 of the CYP2C9 gene and results 
in the loss of an Avail restriction site 

25 ( . . . GAGGACCGTGTTCAA. . . ) in the R144C allele 

( . . • GAGGACTGTGTTCAA. . . ) . This provided the basis of the 
amplification strategy. A CYP2C9 specific intron forward 
primer (HF18, TGCAAGTGCCTGTTTCAGCA, Figure 10) and a 
CYP2C9 exon 3 3 '-end reverse primer (HF2R, 

30 AGCCTTGGTTTTTCTCAACTC , Figure 10) were used at a 
concentration of 250/xM each. Amplifications were 
performed in a volume of 100 fil containing 20 mM Tris HC1 
(pH 8.3), 1.5 mM MgCl 2 , 25 mM KC1, 0.05% (w/v) Tween 20, 
10 pq gelatin/ml, 2% (w/v) DMSO, 200 each of dATP, 

35 dCTP, dGTP and dTTP and 2.5 units of Taq DNA polymerase 

(Perkin-Elmer) . Reactions were carried out for 35 cycles 
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10 



15 



at an annealing temperature of 55*0 for 90 sec r a 
polymerase temperature of 72 *C for 1 min, and a heat 
denaturing temperature of 93 'C for 1 min, using a Perkin- 
Elmer Cetus DNA thermal cycler. The PCR products digested 
with Avail and sized using NuSieve agarose gels (3% 
NuSieve, 0.75% agarose). Presence of the CYP2C9 wild-type 
and R144C alleles were detected as fragments of 50 + 270 
bp and 320 bp respectively (see Figures 3) . The PCR 
product synthesized from human genomic DNA with the 
primers HF18/HF2R was directly sequenced on an ABI 37 3 A 
automatic sequencer. Briefly, the PCR product was first 
purified by using the Wizard DNA clean-up system (Promega 
Co., Madison, WI) . The purified template was then 
subjected to dideoxy terminator cycle-sequencing with the 
primers HF18 and HF2R. The primer-extended products were 
purified and sequenced following the manufacturer's 
procedure. Sequence analysis was done by using the 
MacVector software program (Eastman-Kodak Co., Rochester, 
NY) . 

DNA was obtained from 94 patients. Of these 58 

20 (62%) were homozygous for the wild-type CYP2C9 gene and 36 
(38%) were heterozygous for the R144C allele. No R144C 
homozygotes were found. The frequency of the wild-type 
(Arg-144) and R144C (Cys-144) alleles in the study 
population is thus 0.808 and 0.192 respectively. An 

25 expectation of 3.7% R144C homozygotes can be anticipated 
from the Hardy-Weinberg equilibrium, but the 95% 
confidence interval in this estimation of 0.8-8.4% and 
thus the finding of zero homozygotes in 94 patients is not 
significantly different from expectation. The specificity 

30 of the PCR reaction with respect to the CYP2C9 gene was 
confirmed by sequencing. The alignment of the sequence 
obtained from the PCR product with that corresponding to 
the CYP2C9 gene showed a 100% degree of homology. 
Interestingly, a heterozygous pattern was obtained for the 

35 R144C allelic variant, confirming the high frequency of 
this allele within the normal population. No sequence 
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deriving from CYP2C9, CYP2C18 or CYP2C19 was found 
confirming the specificity of the assay for CYP2C9. 

EXAMPLE 3 

Method for determining the genot ype CYP2A6 

Genotyping for the CYP2A6 polymorphism is 
carried out by allele-specif ic PCR using two parallel PCR 
reactions, one specific for the wild-type allele, one for 
the mutant allele. Amplifications are performed in 0.5 ml 
microcentrifuge tubes in a volume of 45 fil containing 10 
mM Tris-HCl, pH 8.8, 1.5 mM MgCl 2 , 50 mM KCl, 0.1 % Triton 
X-100, 5 % dimethylsulfoxide, 200 /xM each of dTTP, dATP, 
dCTP and dGTP, 250 /*M of the primers B and either J51 or 
J61, 1.25 units Taq polymerase and 1 /xg human leukocyte 
genomic DNA. PCR conditions consist of 40 cycles with a 
denaturation at 93 °C for 1 min. , annealing at 57 °C for 2 
min and polymerization at 70 °C for 2 min. The products 
are analyzed by electrophoresis on 1% agarose minigels in 
TBE buffer and DNA is visualized by staining with ethidium 
bromide. As shown in Figure 4, there are three possible 
results: the individual may be homozygous for the 
wild-type allele and give a DNA product only for the PCR 
reaction with primer J51, the individual may be 
heterozygous with one wild-type and one mutant allele and 
give DNA products with both primers J51 and J61 or the 
individual may be homozygous for the mutation and give a 
DNA product only with the J61 primer. 



EXAMPLE 4 

Alternative Method for Determining the Genot ype CYP2A6 
30 Yor use of F4 and R4 primers, each reaction 

mixture contained 600 ng human genomic DNA, 0.2 /iM of each 
primer, 200 jiM dNTP's, 0.8 mM magnesium acetate and 2 
units of rTth I DNA polymerase. Hot start was as 
indicated by the manufacturer (Perkin Elmer) and the 
35 amplification reaction of 31 cycles of 93 W C, 1 min; 66°C, 
6 min 30 sec. Amplification products were analyzed in 
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0.7% agarose gels and the DNA visualized by staining with 
ethidium bromide. For the exon 3 specific amplification, 
the reaction which uses, the primers E3F and E3R consist 
of 5fil of the 7.8 Kb PCR reaction, 0.5 /*M of each primer, 
200 fM dNTP's, 1.5 mM MgCl 2 and 2.5 units of Taq DNA 

5 polymerase. The amplification reaction consisted of 94 *C 
for 3 minutes followed by 31 cycles of 94 *C, 1 minute; 
60'C, 1 minute and 72'C, 1 minute. 

. Amplification products were then digested 
without purification with restriction endonucleases which 

10 detect the CYP2A6 wild type (no digestion) , CYP2A6vl 

(Xcml) and CYP2A6v2 (Ddel) . DNA was visualized by use of 
ethidium bromide after electrophoresis in 1% agarose, 3% 
NuSieve agarose. 

It is of note that CYP2C9 genotyping can be 

15 performed using an allele-specif ic assay similar to that 
used above for CYP2A6. 
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CLAIMS 

1. A CYP2A6V2 DNA having a coding sequence 
shown in Figure 11. 

2. The DNA of claim 1 having a genomic 
sequence as shown in Figure 12. 

3. A CYP2A13 DNA having a coding sequence 
shown in Figure 13. 

4. The DNA of claim 3 having a genomic 
sequence shown in Figure 14. 

5. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence showing in Figure 12. 

6. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence shown in Figure 14. 



7. A nucleic acid primer sequence selected 

from the group consisting of: 

A. 5' GGCTTCCTCATCGACGCACT 3'; 

25 B. 5' GGCTTCCTCATCGACGCACA 3'; 

C. 5 ' AATTCCAGGAGGCAGGGCCT 3'; 

D. 5' TGCAAGTGCCTGTTTCAGCA 3'; 

E. 5' AGCCTTGGTTTTTCTCAACTC 3'; 

F. 5' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCA 



G. 5' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTC 
TTAGCTGCGCCCCTCTC 3'; 

H. 5' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCG 

CCAAGCAGCTCCTG 3'; 

I. 5' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCC 
CCCGCACTCGGGATGCG 3'; 
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or any nucleic acid sequence of at least 10 contiguous 
nucleotides selected from any one of A-I. 

8. A method of determining the presence or 
absence of an allelic variant in CYP2A6 or CYP2C9 DNA 

^ comprising: 

(a) amplifying an exon containing a variant 
sequence with in said DNA, producing an 
extension product; 

(b) treating extension products with at least 
10 one restriction endonuclease under 

conditions sufficient to produce digestion 
fragments ; 

(c) analyzing the digestion fragments, for a 
variant specific digestion fragment or lack 

15 thereof. 

9. The method of claim 8 wherein a CYP2C9 
variant DNA is being detected. 

20 io. The method of claim 9 wherein. the 

amplifying step is a polymerase chain reaction using 
primers comprising HF18 and HF2R. 

11. The method of claim 8 wherein step (a) is 
25 preceded by a gene-specific amplification reaction. 

12. The method of claim 11 wherein the gene- 
specific amplification is a polymerase chain reaction. 

30 13. The method of claim 12 wherein a CYP2A6 

variant is being detected. 

14. The method of claim 13 wherein a gene- 
specific amplification reaction uses primers comprising F4 
35 and R4 and the exon amplification reaction uses primers 
comprising E3F and E3R. 
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15. The method according to claim 10 wherein 
the extension products are treated with the restriction 
endonuclease Avail. 

16. The method according to claim 14 wherein 
the extension products are treated with at least one 
restriction endonuclease comprising Ddel and Xcml. 

17. A method of determining the presence or 
absence of an allelic variant in CYP2A6 or CYP2C9 DNA 
comprising: 

(a) contacting said DNA with a first primer 
encompassing a nucleotide variation 
specific to variant DNA and a second primer 
which is complementary to a region of said 

15 DNA such that upon hybridization and 

amplification, an extension product will be 
formed; 

(b) analyzing the extension products for 
allelic-variant specific extension 
products . 

18. The method of claim 17 wherein a CYP2A6 
variant DNA is being detected. 

25 19. The method of claim 18 wherein the 

amplifying step is a polymerase chain reaction wherein the 
first primer comprises J51 and J61 and the second primer 
comprises primer B. 



20 



30 



35 



20. A kit for determining the presence or 
absence of an allelic variant of CYP2A6 or CYP2C9 DNA 
comprising: at least one nucleic acid primer sequence 
capable of hybridizing to said DNA; the kit further 
containing instructions relating to the determination of 
the presence or absence of an allelic variant of CYP2A6 or 
CYP2C9 DNA. 
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21. The kit according to claim 20 further 
comprising amplification components and at least one 
restriction endonuclease . 

22. The kit of claim 20 wherein the CYP2A6 
allelic variant is being detected. 

23. The kit of claim 22 wherein the nucleic 
acid primers comprise F4, R4, E3F and E3R. 

10 24. The kit according to claim 20 wherein the 

CYP2C9 allelic variant is being detected. 

25. The kit according to claim 25 wherein the 
nucleic acid primers comprise HF18 and HF2R. 

15 

26. A process for providing a human with a 
therapeutic CYP2A6V2 or CYP2A13 DNA segment said human 
cells expressing in vivo in said human or therapeutically 
effective amount of said protein. 

20 

27. A pharmaceutical composition comprising an 
antisense nucleic acid derived from CYP2A6V2 DNA. 

28. A pharmaceutical composition comprising and 
25 antisense nucleic acid derived from CYP2A13. 



30 



35 
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2A6 intron 2 CCTCCACCCA GATCTCCCCA TATCTCACTA CCCCACCCTC CATC CTC 587 

2AB intron 2 CCTCCTCCCA GATCTCCCCA TATCTCACTT CCCCTCCCTC CATCTCTC7C 5 56 

Consensus CCTCCWCCCA GATCTCCCCA TATCTCACTW CCCCWCCCTC CATCTCTCTC 600 

2A6 intron 2 TGCCT C CATCAC—TC TCTTTCTC TCC CC — A 615 

2A8 intron 2 TTTCTCTCCC CACTACCTTC CCTTCCTCCA TGGAGTATCC CCGTATCCCT 646 

2A7 intron 2 ~— — 271 

Consensus TKYCTCTCCC CAYYACCTTC YCTTYCTCCA TGGAGTATCC CCGTATCCCT 650 

2A6 intron 2 C TGCCC C T GC GGACGCGATC CAATGG—AG TGTG GA G 650 

2A8 intron 2 CI G ITT C 1CJ GCATCTGTCT GTCTGGCCTT TCTGCTTCTC TTCTGATTCT 696 

2A7 intron 2 — 271 

Consensus CTGYYYCTSY GSAYSYGWYY SWMTGGCCWK TSTGCTTCTC TTCTGATTCK 700 

2A6 intron 2 CTAATGCCGT -GAA GCTATGTGCA TCTCTCTGTC TGGCCGTACC 693 

2A8 intron 2 CTTATTCTTT CTACCCGGAC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC 74 6 

Consensus CTWATKCYKT CTACCCGGAM KCTMTSTSYM TCTCTCTSTC TSKCYSTMYC 750 

2A6 intron 2 TGGGT AA TAACCTGATC GACT 714 

2A8 intron 2 TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTA TATATATATA 796 

Consensus TSKSTCTCWM TMWCYYKMTC KMYYTCTCTC TCTCTCTCTA TATATATATA 800 

2A8 intron 2 TATATATATA CACACACACA CACACACACA CACACACACA CACACACATA 846 

2A7 intron 2 ~— — _ . 27 1 

Consensus TATATATATA CACACACACA CACACACACA CACACACACA CACACACATA 850 

2A6 intron 2 — — 7l4 

2A8 intron 2 TATATTAGGG GGGGACTCCC TTTCTGCTCC ACCCTTGGGG AGCCCCTTGG 896 

2A7 intron 2 — 27 1 

Consensus TATATTAGGG GGGGACTCCC TTTCTGCTCC ACCCTTGGGG AGCCCCTTGG 900 

2A8 intron 2 AACTGGTCCG CTCTGCTACC ACCACCCCCT GACCTCTCTC CACCCCCGCG 94 6 

2A7 intron 2 - — — 27 1 

Consensus AACTGGTCCG CTCTGCTACC ACCACCCCCT GACCTCTCTC CACCCCCGCG 950 

2A6 intron 2 714 

2A8 intron 2 TTCACCTCCC CA g58 

2A7 intron 2 — 

Consensus TTCACCTCCC CA 962 

Intron 2 alignment 
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2A8 exon 3 
2A6 exon 3 
2A7 exon 3 

Consensus 



GCGTGG 
GCGTGG 
GCGTGG 


ccJtt 

3GTT 







cag:aac3Gg 

CAGSAACGGG 



GAGCGCGCCA AGCAGCT 
GAGCGCGCCA AGCAGC 



crcdr 



GGG GAGCGCGCCA AGCAGCT 



T 



gc gc: yqc: 
dc: 



50 
50 
50 



£3 50 



2A8 exon 3 
2A6 exon 3 
2A7 exon 3 

Consensus 



2A8 exon 3 
2A6 exon 3 
2A7 exon 3 

Consensus 



ATCGCCACCC 
ATCGCCACCC 
ATCGC CACCC 




CATCCAGGAG 
CATCCAGGAG 
CATCCAGGAG 

r'TTTMTfiM? 



3GGCGTGGGC AAGC^GGCA TCGAGGr- rZZ 
CGGEGTGGGC AAGCGBGGCA TCGAGG^ICC 
TT CGGCGTGGGC AAGCCtfGGCA TCGAGGrf^CG 



Codon 160 




TCCTCATCGA 
TCCTCATCGA 
TCCTCATCGA 




IOC 
100 
100 

100 



150 
150 
150 

150 



—•-Primer J51/61 
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2A8 Intron 3 
2A6 intron 3 
2A7 lncron 3 

Consensus 



GTGAGCAGGG GACCCCGAG? GCGGGGGC? G 
GTGAG7AAGG TTCCCCGAG7 GCGGGGGCJG 

GTGAGYARGG KWCCCCGAG7 GCGGGGGC; 



1 



ICGAGTG 

C 

c 



21 
44 
44 

50 



2A8 intron 3 
2A6 intron 3 
2A7 Intron 3 

Consensus 




gaacc — cgc 
:c#g gaacccccgc 

GAACCrCCGC 




GA73GG GACTAGGTGG 
GA7GGG GAC7AGGTGG 



6B 
94 
94 

100 



2A8 intron 3 
2A6 Intron 3 
2A7 intron 3 

Consensus 




CCGCACTTCC 
CCGCACTTCC 
CCGCACTTCC 




TCTGGCGC1- 



TCTGGCGC7 3 
TCTGGCGC13 



rrzn 



ATTEGGC 
ATT3GGC 
ATT3 GGC 



117 
144 
144 

150 



2A8 intron 3 
2A6 intron 3 
2A7 intron 3 

Consensus 



2A8 intron 3 
2A6 intron 3 
2A7 intron '3 

Consensus 



TCAACMGGC CCTGCCTCCT GGAATTCTGA CTCTCCTCAG ACCTC7GAGI 
TCAAOWGGC CCTGCCTCCT GGAATTCTGA CTCTCCTCAG ACCTCTGAG1 
TCAACJflGGC CCTGCCTCCT GGAATTCTGA CTCTCCTCAG ACCTCTGAGI 



uretere esse 



-Primer B 



TGACTCTCTC CCCAACCCCC CT1jCTCCC|3C 
TGACTCTCTC CCCAACCCCC T-ICTCCCEA 
TGACTCTCTC CCCAACCCCC rCCCTCCOTC 




167 
194 
194 

200 



207 
233 
235 

241 



Intron 3 alignment 
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<-300 bp 
<-200 bp 
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<-300 bp 
<-200 bp 
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12 34 5 678 9 

—OH] M Q HHHl CYP2AS wild type 

1234 5 6789 

—0-0 — [HI — 0 — \HHH\ — cyp2asvi 

L— H 

12 34 5 6789 

—0-0 — H — 0 — hO-H — w«« 



FIG. 8B 




F4 



12 3 4 



6 7 8 9 



=*Q\} H 0— 4^-{H}nr- CYP2A6 




7.8 kb PCR 
product 



Dde I Xcm I 



I I 



E3F 



Exon 3 



E3R 



201 bp 



141 bp 



60 bp 
► 



59 bp 
«« H-«- 



142 bp 



CYP2A6 wild type 

CYP2A6v1 

CYP2A6V2 
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CYP2A6v2 cDNA. 

5' TCTAOC^(X ATGCira50CnCAGGGA^ 
QACItnGAItKnCTIGAlGICIGIT^ 

ACAC^GAGCAGAIGTACAACIXXCTCATG^ 

ATGCnTICAGGGAGGCTCTC^ 

GAGCAAGGCACXTia3ACTCGGrciTCAA^ 

AQGGGGAGOGCtKrAAGCAGCTOaiGOOaTO 

TOGGGGrGOGCAAOOGAOGC^roGAGGAGOGCATOCAGGAGGAGIC OO^ 

CICATOQAGGGCAT00GQAGCAOQCA(3jGCX3O^ 

CTOAGGOGCACAGICTDCAAiGflCAICA^ 1GGGGAC0GCTT 

TGACTATAAGGACAAAGAGTKXTIXTICACTGTO 

CTItX^GTICAOnCAACXna^O^^ 

TGATGAAACACXTOCCAGGACCACAGCAACAGGCCm 

GCIGGAGGACTICATAGCXIAAGAAGGTGGAGCACAAGCAG 

TCOCAATTtXXX^OGGGACnTCATroACltJL'1'1 1 C R1ATOCGCATGCAGGAGG 

AGGAGAAGAACCCCAACAOjGAGri^^ 

ACGTIGAACCTCITCATIGCAGGCAOC^ 

TOGLTlCriUClCO C ATGAAGCA(XC^^ 

GAGATTGACAGAGIQATOGGCAAGAAOOGGCAGOGCAAGT^ 

GCCAAjGATCCCXTACATGGAGGCAGTOATCXZ^ 

GAOGTGATCXIXXATGACjrTTGGCXXGCAGA 

CXXI A IlTlUriaJl G OCrAAGGGCATAGAAGfl^^ 

CTGAGAGACXnX^CXJTICnClC^ 

CTGGGTGAGAAGGGGCAGTrTAAGAAGCGTGATGC^ 

TCAGAAAGCX3GAACItjnriTCX5GAGAAGGCCKKj^ 

CTICITC^CX^COGICAT 

AGGACATTGACXTTCJIXXXXEAAACAOT 

rTAC ACT ATtlAI A 1 1 1 1 1 1 'll'/YYTGCTCl AGOGAGGGCIX T rGC^ 

GTGGGQGGGGQCAGGGAAAGGGCAGGGCCA AQA(Xr^^ 

figAGCTAAGAC TTGGGGGCAGGATGGGGGAAAGGAAGGGG 

Ar^TTAAGAGA 'AnAAACAOAAr^Ymt!AGTIt^anTGATA^ 

rTAr^Tr^TtaAGAGGAAGG AAAa^ACATrATGCTATSAAGAGTAGT 

AATAATAGCAGCTCTTATTTCCTGA3' 
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, 1AGTTCCCCT GARATATGGC TCTG BTCWC CTCCCCTTGC CAATGAAGAA G^MCASTG 
1 AAGTTCCCCT BA*R««w*. j~ ICAC KTCAGGTTC CAATGAGGAT TCTGGGCATC 
61 GAGGTTCTAT GGCAGCCATC CTOGCCTCAC GCACCCAGTG CTGGGCTGCT 

121 S^SSS'SSS ?SSS1 AAACTCCACA 

181 GGGCTTTCTG GG AGAACGCC GCTGGWTTG CCATATGCCT GGAATCCCCC 

241 CCCACAGCCC TGGGTCTTK £S£agI ££*AAATG CACAGCCACA 

301 TTCCTGAGAC COTOACCCT GCATCCTCCA P™"*^""" r^eeeCTCOA ACCCCCAGAT 
36l CTTTGTCWA CCCTAATAAA MXCMKg cluSScC 
421 CCGCACAACT TO=GGTGCA TTCTCACTCT CAGUXOAA SctTOOCC CTCCTAAATC 
481 CCTATCCAAA TATTCCAAAC JCCTCMTTC *JCAGCJTAT ££gSgC CCCCTCTCTG 
541 CACAGCCCWi COCCACCCCT CCTBAAOTAC Cggggg GTGTCCCAAG CTAGGCAGGA 

SSSSS SScIS S» AACCACCCCA GCCQTC&CCA TCTATCATCC 

!^ StogcS cagggatgct tciggtggcc ttgctggcct gcctoactct 

S 5S£ SS A. SaS 

901 CACCCCAITG pwlpwircCAAG ACAGGGAGAT GGG5GTCTCG GGGTGGGGGC TGCCTAGTTO 
961 ^S^SSS^^ AATGGAGm 
1021 GCTG fS! ^S£S SgatS TGGGATGTCC AGCTCCCTGA CTGTCAGAAC 
10 81 TGGAGTTTCA GCATCAGAAA GACAGG^CT cgccCCATTC AGAGTGGAGG 

1141 CTGGGTGCGA AGCATCCCAG CACATGACAT rieJUSCQCTA TGGCCCCGTG 

Sol GTTCTCCCTC TAACCACTCC C^OCCAOgC TGCCCTCAGG 
1261 TTCACCATTC ACTTGGGGCC CCGGCGCGTC «£^TCT ^»CA^ ^ ;crrcCAC 
1321 GAGGCTCTGG TGGACCACGC <reccciuiGAG GGGGAAGGX© GGCAGGIGGA CACGAAGOTC 
" " TG °!Sg SS SSSJS? GAGGATAAGG GAGAGTCCa 

™ SSS WGGGGCCTC TCCATGTGTA TCCCTCACCT 

1501 AGTCTGGTCT TO*tk«£a TCCCTCTCTC TGCCCCACCT CCTTATTCTC 

15 Ji 25£££a gSSSJ SSSSS SSaSct aaggacatcc tgggtttctg 

1621 gSgg GTCTACA3GA GTCTTTGAGC CCCTOTAGC TTCTGGGCIT 

1681 T ™ < ^ SC ? SSS? CGGATCCCTT TCTCAATTCT TCCTCTGTCT TAGGATCCCA 
1741 CTCTGGGTTT CTCATCTCTC rrATCTCCTG CTAACAGTCT CTCWCCTTC 

1801 GGGTTATTCC ^TICCACA £££££ SSgSS C^AGAATC 

1B61 CAGACCCTCT CTGTTTCTA.T CT^ATATTA „Svr£cAT ATCTCACTAC CCCACCCTCC 
1921 TCACACCAAG AGAGGATGTC C^ACCCAG *£TCOX*T ££££££ GATCCAATCG 
1981 ATCCTCTGCC TCCATCACTC JCTTXCTCTC £££££ SSSS CCTGGGTAAT 
2041 AGTGTGGAGC TAATGTCCTG JAGCTATGTO ££££££ cccCAAGCAG CTCCTGCGCT 
2101 AACCTGATCG ACPAGGCGTG ACGCATCGAG GAGCGCATCC 

2161 T1CCCATCGC CACCCTGAGG e^^GGGG «AreGTGAG CAGGGGACCC 

2221 AGCACGAGTC GGGCTTCCTC ^CGAGGCCA ScScGCG CGWCTGCCT 

s=SSSs== 
as sssss |ss s ™ -ss 

2641 CGTCAACCTC CACGGGGCAG GTAATGGTTC ^SS: CCCCCGGACA 

2701 CCGGCAAATT GTO CCCCTAC CGG^GAAGG GGGCCCCAAA SSc^GGG 
2761 GTGTCCCCTC AAAATCAGTC ^gS gISaACaS TGCTCCCCAA 

2821 TTGGTTGTCC JSSaSS CAG^AGCT CTCICACCCT GGGCACGTGT 

2881 AACAGAGCCT «TGGCA^GA ^^CCCT ctaccCAGGT crTCTTGAAT 

294i tcccatcccc *£™cc** ^SSS aotSSg TAAACTTTAG AGATTAGWC 

3001 ATTTTAACAC CCGGAAACCC TGGGXACwi* *™I .TGCCTTTAA CTCAOTTCCI 

3061 CTATCOGGCC CCTCTOAAAT *CC£AOCAC OGGAGAO^G JTGOCKTAA crotcCTrcc 

.3121 TCCTTGCTAT SSSSJ TCT CTTOGCT GATGAAACAC 

33S SS SggS ACGCGGGGA GA^CAAAGC 
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3481 AACCCTCATC ATAATAATCC TCftCAATTGG CTGGGTGCOS *«g£AC£ CCTGT^ 
354! CGCAC^O GSAGG^S GCAGGWGAT JSSS SS^TGCC 

3601 GGCCAACAIG ^AAACD* SS^S AoSSSgC ATTCCACTCC AGTCTGGGTG 
3661 GOGAAGGGGC GCAGAGGTTG CAAIGAGCCA agTGAGCCTG 

3901 WJ^AAACAA Sag^SSgG G«»ACGCCA GGAGTCAGGG CAACCOCTGG 

J961 GIAACGAAGG TG ACAAGQAA oAGW^W CCACCCTCCG qTCTtGCCC C 

4021 TTCC S^ SgSg^ SgSS SSSSS SIS TCTTGGGGXC 
4081 AAAGAGAGGT CflAGGGTGCT gaGTGGAAGA TCCATGGGGT GAACCCCTAG 

4141 CCCTCTTGAC CCCCATTGGT CTGAAOCTAA qcScCTCTC CCTTCAGGAG 

4101 ATGGTGCCCT GAGGTCAACC "mBOGUB SScTGA tSSScAC GTTOttCCTC 
4261 GAGAAGAACC CCAACACGGA ^SSS -SSotCTT ACTGCTCATG 

4321 TTCATTGaG G^CGAGAC «TCA^ACC £0™^ AGTGQAGGGC CCCAGACCCT 
4381 AAGCACCOVG 30010 0*03 ™KCCTW£ cOUXTCTCC CAGAICCCGG 6ACCCTGAGA 
4**1 CAAAWCGC CAGGGCfiACA TTCAGCTGGT AGGCAlScC TGAGTCTCAT 

4501 CGTGACTICC ^TCCAGAGA <j£*™££ TCATTGGTCA C?TCACTTCTG TCCCAAGCCC 

4561 SSSS? cXkccctaa GTTCCTCCCT GTGCCTCCCC 

4621 ACTGASTGCC CACTGCCCGT TCCAt "f™^ «rv^r*eOL aCAATGCGAA TGCCTGATGT 
4681 TGXGATTCTG TACTTCGGTC TCATAGCCGG AGGCATWCA TCCACCCCAT 

4741 CTtfW-XO TT ATGAATCCTC ^^.JSgaoSt AGaScCTAA ACACATTCCC 

4801 mroccTATC cggactatca tttcctgcic tgatcggcaa gaaccggcag 

4861 CCTCCTCCCC CAGCCAAGGT OCATGAGGAG S^S^A CGAGATCCAA 

4921 CCSAAGTTTG ******** £££££ 5SaSaGGA CACCAAGTTP 

4981 AGATTTGGAG ACOTGATCCC CATGK5TTTO reg. ^^pcRc qggGACTCCA 

5041 cgggatttct tcctccctaa ««gcta« ^Sgaac ctSctagac CCTGTCCCAC 

5101 GCCCMCTCT GTGTCCCCAG CATCCCACg StTcScC TTTCCACTTA 

5161 TCCCTCAATC AGTCAAAAAA GACTTCCCCA ACCACCACAT CCGTT^ juyeaofflHT 
5221 GACACTCCTG AGTCCTGCAT CTCTCCAGAC J^CCTTT TGTCATAGGG 

5281 CCCAAACTTC CTATCTXAAG JAACAG^GC mSaOXCA TGICTCCCAA 

S341 ACACAAATCT CAGCTCCCCC AAACTCCTCC ^M***"*^ »^ QAOGTCCCCA 

5*01 ACTTCCXGTT ^^^^^^^ CCCTCACCGG GGCACCCTAG TTCCCCCTCC AGCCCCTGTG 
5461 ATTCGCATGC CTOC^CTTC COTCACCGG TKCTCC ICC CTCCCAGGGC 

5521 TACTCTCAAC AATCCCCCAA "WSOTT tcAGGTTCTT CTCCAACCCC 

5581 ATAGAAGTGT ^TA^ SSSS SSaSc AG^SaA GCCTOATGCT 
5641 CGG GACTTCA ATCCCCAGCA ™CT5GGT g*gffgg% JcACCCKAC TACTCACACC 
5701 TTTGTGCCCT TCTKATCAG £S£S£ AGCCXAGTAT TTCCCCAGCT 

5761 AGCAGGGGCC TCCCTTACCC AGTTCCCCTC TGATACTCCC TTAACTACCA 

5821 TGGCAAGTTC CTGTTA^ AaSSS ^TACCCCT TTCAGAGGCG 

5881 AGCACCCAGT GATTTATTTC CCTAGGSTCA CACAGGAGAT 

5941 GGGGAAAACC AAAGGCCA^ C* 0 **^^ cacaqcaGGT CATATTXGGG AGTTCTTATC 
6001 TCTTCAGCAT C^TAAAAAG ^GATGA^ GGCATCGATC AACCCCATCT 

6061 TGGGGGAAGO GGGKKTT^. ACCTCCCATT G ™=^^ gGAGGGTCAA GAGGCTCCCT 
6121 TTTGGTCATC TTWGGGTCA CTCAAGGAAA SSaGCC GCAGCTCGAG 

6241 GTCGGTACTG GGGCGAGGCT ACoa ., ltt C c e AGAATGSAGC TCTTTCTCTT 

.6301 CICCTCAGGA AAG^GAACT JTTTCGG^ CAGtCACCTA AGGACATTGA 

6361 CTtCACCACC GTCATGCAGA mTCCCM:(al axcTACACCA TCAGCTTCCT 

6421 CGTGTCCCCC MOM ££££££ KGGGGCCAG GGAAAGGGCA 

6481 GCCCCGCTGA OCGAGGG gTO TCCCGGTOAA GGGGGCAGGA TGGCGGAAAG 

^54^- < *'^^* : ^^ ^CTGOCTAGA G^WAGA^AA CAAACAQAAC CGGCTCAGTT CACCTTGATOi 
6601 GAAGGGCCGT GGTOQCTAGA xjULeeCT raC ATTATGCTAT GAAGAGtAGT 

6661 AGGTGCTTCC GAGCTQOGAT JAG^AAGG AAACCCT^C TCAAAAACCA 

5S SS S£ 'c^SSg gggcgttcat gccca^a 
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.^sAGGCT TACAAAGTTG TCTCTGATGT CTCACAAAAC KtMOtGOX 

6841 CACGTGACAA AACXCAGGCT Tic ** *"^ Tafl CCTTCT A. GACACATTCT TAAAAAGCAC 

6901 agaaaatctg cgaacacaga wwwccca CCT^ACAIC CCTCTCCGGG 

6961 cTxrrccrcx cgcaaaacag £££££ actgcacaca tgcccacact 

7021 oacrtcccc agagacctgg gooctggttg ccctoccttc ccagaiaagg 

7081 CTCACCtlCT CAACA1CCTG ^ACXACCCG £££££ SSStG GAOVTACAGG 
7141 OCACXOTAGC CCATTCAGAG TCADCCCAGG GACACAACG* 
7201 GTCAGTCCAT TAACAA 
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CYP2A13 cDNA 

GACIUIX3ATGGTCTK2ATC7nZAGTXnX^ 
GCTCOCnXXGGGAOOCACCOCATIXXXXTr^ 
CACAGAGCAGATXJTACAACIXXCTCATC^ 
GKHTC^OCATICACniGGGGC^^ 

GODGflCAA OCAG GCIClGGIGGAOCAGGCIGAGG 

Gc^GGacAOcnajW^ncQcicn^ 

CWGAGOOCXXXaAGCAGeia^ 
GCXHXjGGCAAGCGCIjGCAroGAGGAAOGCA^ 

AGCEGCACAGICroCAATaro^ 

CTATGAGGACAAAGAGTIlCXTG^ 

CAOnXL\OGGGAA(XntX^OGGGGCAG 

GAAACAOCTGCXIAGGACCACAGCAACAGGCXTI^ 

GGAGGACTTCATtXKXAACL\AG^ 

AATFOOaCACOGGAJCrFCATCCACTCCn^ 

GAAGAACCCCAACACAGACnTCTACTrcAAGAAC^^ 

GAAOJlCllUiriljCXSGQCACIGAGAOCXaGAGCAOCACO^ 

CCTXXSOTCAIGAAGCaGC^^ 

GACAGAJJKzATCXXjCAAGAA 

ATCGOCTACAGAGAGGCAGTCATOCACCA^ 
CTCXniATCGGTITGGCCXIACAGGGT^ 

GAOGCCAGtil IV nVLU ^OOCCC A GGACIOCAaKn^ 
GAGAAGGGG CAGnT AAGAAGAGTOATG Cl 1 rTOTGCCXTnTO CATOGGA 
AAGCGGTACIXjTTITGGAGAAGGOCTGGCCAGAATG 
TCACCA£XATCATGCAGAACTritXXrr^ 
ATOGAOCnXJroOOCCAAACAOCJroGGCTTrGOCADGATOO^^ 

CATGAGcnanaxnoGapc uGmAQGGaG^^ 

AAGAATtKXKjGCAgTIGGflGGAAGflAAfi GGGAGAGO^ 
GAAGAAACAGAAGOGGCIX^GTTCAQCTTtt ATGA^ 

ATGAGAGGAAGGGAAACCTTACAGTATGCTACAAAfiAfl TAGTAATAATA 
GCAGCTCTTATCTOCTGA 3' 
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3421 CCTCCACTTC AGCAICTTCA CCAGCCCCAC TTTATACCTG AGCACCTGAA 

3 iHl CAATCCAGAC CCAOIAAGTA TCT6GACAGC TGTCTCCAAC CAAGTCCACT TGAAKXXTCA 

SS SSSSoI SSgTGCCAC TCACCTCATA CCAGCCCCAC CTGAAGAGCT *A*CACCTGG 

/MnimuuiHn njuwnriL nrvrCTeAAC ACCTAQATGT GTGCTCCAAT 

3601 ACACGTSTCT TCCAACTCAA CTTCACTTGA iSSuSc ACCEAAATTA 

3661 CCAGCCTCAT TTGCATACCT GAAACCTG6A IATAT6CCTC »°™TTCTC J^***™ 
3m CTAQACCOT6 CCCCTGGCAC «^ SS 

"„,, t wgpra um OACCTGCAAG GGCTCGAGGA CTTCATCGCC MGMWnib 

3961 CACAGC A ACA GGCCTXwmu . „- rlr ■ nt -i- i - ig. T CA 

* , msscamcTO GAffCCCAATT CCCCACGGGA CTTCA1CGAC TCCTTTCIGA 

4021 AGCACAACCA waiCflnna « mrennee CAGGGAGAGG 

4081 •TCCOCAMCA GGAGGTACAT CCCAGCACCC AOTGCAGGCA GfflRK»AAGC CAKMM***, 
llS GAAAXCMGA TOGGAGTGGG CT60CCAQAC G AC A C A GGCr CATTCAAATT AGCOCTCOTC 

4201 ATAAIAAJCC OTACAATTGO CAGGAGRCC AGAGCAGCCT GGCCAACAXG 

4261 GGACCCCGAG GCACGTCOAT CACCTGAGGT CAG6AGTTCC *»«AOT 

AMI fflXSUWCCCC GTCTCTACTA AAAATACAAA AATCAGCTAG CTATGGT6GC AKWGCCTGT 

SS SSSgCT ACTCAGGAGG CTGAGACAGA AGAATTTGTO TGAAICCGGG AgCACAOBT 

SS CGGGATCATG CCACTGCACT CCGGCCICAO *»CAGAGCA 

4501 AAAAAAAAAA AAAAAAAAAA AAAAAATTCC AGG1GGATTA GATTCGAAAG 

4561 TCCCATCTAC TGAGCCCTCA CCCACAAGGA C&SCTTOTGG ^SWJ 3 ^ SiSgCCA 

4621 AACTTCTCAA GAACTACCGG QTGCCACGAA CTGGGTTAAO TGTTTTAMA TAGTCCGOCA 

SS J^AACACTT SaACAGTTC TTGAGGGACG TTCACTCATG QCCCCAfilTG TACAJUKTGAC 

SS £££££ — 

4801 TOGTCCCCCT AGCTCAAACC CTGOTCTCTC TCAGCCTATA CCTGOT SCTT TTAHTOM^ 
SS CCGTTCATGT CCIOGTTAGC AGACACACCT CTGTGGACAG CTCACCTGGC 

SS SSSc AGGGTCCCCG CCXACCTCTO GAlCTCAGCC TCCCATGTGG 6AMBCTTTA 

SS SaS»a Scaoggag aaaggatcaa gggagcga** cerecACA GT aaottcaag 

en" »S™flOO AAGAAATAGG AT BC TOCT CC CTAAAATTCT GTGCTTGTAT CTCACAAAAA 

S041 ATITTTAGGG AAB&AATM*« — — ,,, m cT COOCTC 

«D1 gFCCTTTT TT CTOACTCrrC A TCTTQ CCAT CTCTGXACTA CTTTCTCPTC ^«***^ 

SS ^^icr ttccaaatat tcctatcatt aaaaaagtaa cagactggga aacatogcaa 
SS aaSSSS ££a£Sa aSgctogc atggtggtgc atccctgcgg tcccagc^c 

SS SaIgAGCTT GABGTGGGAG GATATCTTGA GCCCAGGOW GGCAGACCT* ^^GCC 
SS SJScACAG CC CT GCO CIC CACCCTGGGT GACAGAATAA GACCgTGTCT CCCAAAAAAA 
SS SaAGAATTA StStSaC AGTTAACAAG TSAOCCTGCA TACTCATCTG »»«GCAGr 
S461 TKAC^CT CTGSACGCTG AGACCGGAGS ATTCCTTGAA CCCACCAOTT «»CTCCAGC 
S£ iA^AOACC AAGTCTGCAT AAAAAAAAAA AAAACCAACT GACAG CT A AG 

SS SSaatS AA^St 0ATCAGTGA6 OIAAAGAAOG TGAGAASSAA OAGCMTWC 

SS SSaaS ccagccaggg caaoogc^g aacctcgagc iSSS 

57fll COTCTTTCC ACCTTTGGTC TGGACCAAAG AGAGQTAGCT CCAAAGQAAA AG^AGAA 
SS SSSS MCMOQAGA GTQAGCTTGG TCTAAACCGC CCTCTCCCTB CAGCAGGAGA 
SS AGaSSS JaSgAGTTC TACTIGAAGA ACC^GTGAT GAOCA«Ta AAC^ 
5Bftl TTGCGGGCAC TGAOACCGTG AGCACCACCC TGCGCTACGG TTTCCTGCTG CTCMR*«^ 

SS SSSgst ggagSag actggaaagg gaggaaagto aaggccccca 

SS aScStoa S?ggtgca gtgtacccac ctatcccaga tcccaggacc ctgagaostc 

11V: ffyrTLrr CAGASACAGG ACAATATTCA GCTGATAGGC ATCAGCTGAG TCTCATTAGC 
6061 CCITGCTGTC CAH«H«tJiuw «»irfM»*~.*«-» , « w ■■..-rr AAGCCCACTG 

B191 TXTTAAAATA TTGAAAATGT CTCCACTGAT TCGTCAGTCA CTC CTGIOT 

mwOTwnSSer tcaitgaasgg gct^akctt cgtgtSgag gtggaagc^a tctcaaccgc 
^ 0 K t SSZ'SSSS SSaa gacccctaga gagggtaaag 

6361 t"™^ S^SS CCAAGGTCCA TGAG6ACATT GACAGAGTGA TCGGCAAGAA 

!n^, ACtCCCTCAA TCAGTCAAAA AAGACTTCCC CAACCACCAC ATCTGTTCCA 

6721 ACICTGTCCC Atrrca-"-«w\ *'-~**~~**~* » i»PTrtCH8 TCAGGAGAAZ 

6781 CCirrCCACT TAGACAGTCC TGAGTCCTGC ATCTCGCCAC ACTCTTTGTC TCAHGW-im^ 
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3421 CCTCCACTTC AGCATCTTCA CCA GCCCC AC TTTATACCTG AGCACCTGAA CAAAAGCCCC 
3481 CAATCCAGAC CCAGTCAAGTA TCTCK5ACAGC TGTCTCCAAC CAAGTCCACT TX^ TOC C TA 
3541 AATACCIAGA CAGGTGCCAC TCACCTCATA CCAGCCCCAC CTGAAGAGCT AAACAUJU S^ 
3601 ACAGGTCTCT TCCAACTCAA CTTCACTTGA A3ATCTGAAC ACCT AQA3FGT <5TGCTOCAAT 
3661 CCAGCCTCAT TTGCATACCT GAAACCTGGA TATATGCCTC AGTTCTTCTC ACCTAAATTA 
3721 CTAGACCGTG CCCCTGGC A C CTAATCCACG MAAAACTTA GATATAASTT T CC ATCCAAC 
3781 CCCACTCAAA TACCTAAACA CCTGGACAOA TGCCTTTAAC TOOC nCeCT CCTTGCTAIG 
3841 AAACAAATCC CCATTCCCAT CAGCTCCTGC CC CGT Q ACAC CTGTCCTTCC CTTTCCATCC 
3901 TCTCTCTGCA ACCOCADCTC TATGAGATGT TC TC TTC GG T GATCA AACAC CTGCCAGGAC 
3961 CACAGCAACA GGCCTTTAAG GAGCTGCAAG GGCTCCAGGA CTTCA TCGCC AAGAACGTCG 
4021 AGCACAACCA GCGCACGCTG GATCCCAATT CCCCACGGGA CCTCATCCAC TCCTTTCTCA 
4081 TCCGCATGCA GGAGGTACAT CCCAGCAGCC AGTGCAGGCA GGTGCAAAGC CA GGGAGAGG 
4141 GAAATGAGGA TGGGAGTGGG CTGGGCAGAC CACAC A GG CC CATTCA AA3T AGCCCTCOTC 
4201 ATAATAATCC TTACAATTGG CCAGGCGCGG TGGCTCATCA CCTCTAATCC CAGCACTTTG 
4261 GGAGGCCGAG GCAGGTGGAT CACCTGAGGT CAGGAGTTCC AGAC CAGKT GG CCA ACATC 
4321 GTGAAACCCC GTCTCTACTA AAAATACAAA AATOAGCTAG GTATGGTGGC ATGCGCCTGT 
4381 AATCCCAGCT ACTCAGGAGG CTGAGACASA AQAATTT GTT TGAATCCGK5 AGGCA GAGGT 
4441 TGCAGTOAGC CGGGATCATG CCACTGCACT CCGGCCTGAG TGACAGAGCA AGAC CCtCTA 
4501 AAAAAAAAAA AAAAAAAAAA AAAAAATTCC GGAAAACCCC AATTACATCA CCCACTGCTG 
4561 TCCCATCTAC 1GAGCCCTCA CCCACAAGGA CGGGTTATGG A GCTGGA OTA GATTCGAAAG 
4621 AACTTCTCAA GAACTACCGG GTGCCAGGAA CTGGGTTAAG TGTTTTATCA TAGTCCGCCA 
4681 TGGAACACTT TTAACAGTTC TTGAGGGAGC TTCACTCATG GCCCCAGTTG TACAAATGAG 
4741 GAAACTGAGG CCCAGAGAGT TTAAGTGTCT TAACTGAGGT CACAACAGTG AGGAAGACCA 
4801 TGGTCCCCCT AGCTCAAACC CTUG TC TCTC TCAGCCTATA GCTGOTGCTT TTAGCCACCA 
4861 TGCTCTCTAA CCGTTCATGT CCTGGTTAGC AGACACACCT CTGTGGACAG GTOACCTGGC 
4921 TTTACATTGC AGGGTCCCCG CCTACCTCTG GAlCTCAGCC TOXATOTGG GA AGGCTT TA 
4981 GGAAGCCAAA GCTCAGGGAG AAAGGATGAA GGGAGGGATT CCTCCACAG T AAGTTTCAAG 
5041 ATTTTTAGGG AAGAAATAGG ATGCTGTTGC TTAAAATTCT QTGCTTGTAT CTCAGAAAAA 
5101 LltaTOW ' CTGACTCTTC ATCTTCCCAT CTCTGTACXA CTTTCTCTTC GTCTCCCCTC 
5161 ATCCTTCTCT TTCCAAATAT TCCTATCATT AAAAAAGTAA CAGACTGGGA AACATGGCAA 
5221 AACCCCCTCT GXACAAAAAA ATGGCTAGGC ATGGTCGTGC ATGCCTGCGG TCCCAGCTAC 
5281 TAAGGAGGTT GAGGTGGGAG GATATCTTGA GCCCAGGGTG GGCAGAGCTT TCAATGAGCC 
5341 GATATCACAG CCCTGCCCTC CAGCCTGGGT GACAGAATAA GACCGTGTCT CCCAA AAA AA 
5401 AAAAGAAOTA ATTTTTTAAC AGTTAACAAG TGAGCCTGCA TAGTCATGTG CATGTGCAGT 
5461 TCCAGCTACT CTGGAGGCTG AGACCGGAGG ATTCCTTGAA CCCAGGAGTT GGAGTCCAGC 
5521 CTCTGCAACT TAGCAAGACC AAGTCTGCAT AAAAAAAAAA AAAACCAACT GACAGCTAAG 
5581 TTOACAATTA AAGGATAGAT GATCAGTGAG GTAAAGAAGG TG AGAAG GAA GA GCAT TTTG 
5641 GGCAAAGCCA GCAGCCAGGG CAAGGGCTGG AACOTGGAGC GAGTTTOGCA AATCTAGGGT 
5701 CCCT C TTT CC ACCTTTOGTC TGGACCAAAG AGAGGTAGCT CCAAAGGAAA AGCCCTAGAA 
5761 GGGCCCCAAG AGCATCGAGA GTGAGCTTGG TCTAAACCGC CCTCTCCCTG CA GGAGGAGA 
5B21 AGAACCCCAA CACAOAGTTC TACTTGAAGA ACCTGGTGAT GACCACCCTG AACCTCTTCT 
5881 TTGCGGGCAC TGAGACCGTG AGCACCACCC TGCCCTACGG TTTCCTGCTG CTCATGAAGC 
5941 ACCCAGAGGT GGAGGGTAAG ACTGGAAAGG GAGGAAAGTG AAGCGCCCCA GACCCTCAAA 
6001 ACTCCCCTCA GCCTGGTGCA GTOIACCCAC CTATCCCAGA TCCCAGGACC CTGAGACGTG 
6061 CCTTGCTGTC CAGAfiACAGG ACAATATTCA GCTGAHAGGC ATC AGCICAG TCTCATTAGC 
6121 TATTAAAATA TTCAAAATCT CTCCACTGAT TGGTCAGTCA CTCCTGTCCC AAGCCCACTG 
6181 AGTGTCCGCT GCCTGCTCCT CTGGATCATC CCCTAAGTTC CTCCCTTGTC CTACCC TOTC 
6241 ATTCTGACAC AACCTGCTTT AACAGGGATC CTGCTCCAAA CAATGGGAAT GGGTG ATGTC 
6301 TTGTTCTTCT •PEATCAATGO GCTTACCCTT CGTGTCAGAG GTGGAAGCTA TCTCAACCX5C 
6361 CGrorXWAG CTAGGGGGGG CGATACATGC CCTGCTCTAA GACCCCTAGA GAGGGTAAAG 
6421 ATATTCCCCT CCTCCGCCAG CCAAGGTCCA TGAGGAGATT GACAGAGTGA TCGGCAAGAA 
64B1 CCGGCAGCCC AAGTTTGAGG ACOGGGCCAA GATGCCCTAC ACAGAGGCAG TGATCCACGA 
6541 GATCCAAAGA TTTGGAGACA TOCTCCCCAT GGGTTTGGCC CACAGGCTCA ACAAGGACAC 
6601 CAAGTTTCGG OATTTCTTCC TCCCTAAGGT GCTGTCTCCC CTCCACCACC A CCACTCAG A 
6661 CTACGGGGAC WCCAGCCTC TCT CTGTG T C CCCAGAATCC TGCCCCCATT A tfiwICTAO 
6721 ACTCTGTCCC ACTCCCTCAA TCAGTGAAAA AAGACTTCCC CAACCACCAC ATCTGTTCCA 
6781 CCTTTCCACT TAGACAOTCC TGAGTCCTGC ATCTCGCCAG ACTCTTTGTG TCAGGAGAAT 
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6841 ACACCCCATG TTCCCAATCT TCCTGTCTTA AGAAACAGAA G^CCTW ^^0000? 
6901 TOKSGCTrac GGACACAAAX CTCAGGTCCC USSS 
6961 CCRTCTCTCC CAAACTTCCT GTCTCAGAQA CATCAAACTT ™CCCC CA J^u^ 
7021 CTCAGABGTC CCCAAC1CCT CCATGTCGTG CCACTCCCCG CACeiCCCCC *CCttMS*EC 
7081 W CCCTGTCTAC -mCACCAAT CCCCCCAACC ^A«J »M« 
' °; XtTuireCCT CCCAGGGCAC TCAAGTCTTC CCTATGCTGG GCTCCGAGCT SAOAGACCCC 

CgACTCCAGT CCCCACCACT 

7«1 TTTAAaAAGA cmCRT *ggggg 
7321 GaCCACGSCT CACACCAGCA GQOOOCTCTC TCACCCACCT CCCCTCTCTG 0G8TGTAGCC 
73B1 TGCTATTTCT CCAGCTI«» ACTTCCICTT AOAATC»CC AWOAGCCGC C»OC»GCTGA 
™\ SSSS ACCCAATACC 1GCGCCCAGG 

7501 Tcccccmo aottatttot ctaggqzcac ACACcAfiArr cttcacctcc ctmwaagga 

7561 GA^ATOWA CACCACTRTft GTCATATTXC CAAQTGTATC 1GGGGGCTAS «8CMCTaX 
S AOTCCCATT GCTACACCKS GCATCGATCA CCCCATCTAT ^ATTATG 

7681 ccSStcoa aacccaiaga actqtataac acacagtoaa ccctaatcia Mcraamc 

7?S JUnuoMf CAATATTGGT *C*=A«« J^TCTT JSStSS 

•rnni AffRMkAfiCTC AGGGAGBATC GGAffirCTCCT CTGAAAGTCT CTCAGGCCAT AATATTCCCA 
™J JSSSS JSSS^G CAGCCGGGGG TCAGTAGGGG mmc 
7921 ggcttcacS tcaccoctcc tocctciccp CCTCAGGAAA CCGSTACXGT tnWACAAG 
79B1 GKieKW AATGGAGCTC TTICTCTTCT TCACCACCAT CATCCAGAAC TTTCGCTTCA 
25 SS^AG GATATCGACQ «T«CCCAA 

8101 TCCCACGAAA CtACACCATO ACCTTCCTGC CCOOCTQAOC OAGGGCTCTG «^tACG 
SSI SgCKGCC G6GGCCAGGG AAACGGCCGO GGCAGGQGCO CGGCTTOTGG ^f^SS 

Jm SSagaatc ggggcagtgg gggaaggaag gggagagotg cttagaogqa acagaagaaa 

SSl CA3AAGGQGC TCA6TTCACC TTGATGATCT CCTTCASAGC TOTGA^ SSSS 
B341 CCTTACAGTA TGCTACAAAG AGTAGTAATA ATA6CAGCTC TTATCXCCTO AACAAPTCCC 
Hoi ££SScAG CTTTOTICAA AAAGCGTTGC ACGCICACCT CAOTATTTC CC^ACACCT 
HA CMcSaTQO GGC3AAAAGTC TTCATTCCCC TTTTTACACG TGAflAAAGGT GCGGCTCAGA 

S£ Stctgaaaa ctcacaaaac gcaagigtcc agacoatcto ggaacacaga 

8641 GCTTAGTATA GCATCACATG GCCTGAACAC CCCTGGGCOG GGGGGTTCCC CRBRWi^iw 

"S SqgSct gccctcccpa ctctgtacac tcgcctactc gggacgatcc gggcaccagg 

8761 CTGTCACCTG AGCTCCCTA 
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